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© Mapping digital data sequences. 

© In a so-called trellis shaping modulation system, instead of mapping a digital data sequence into a sequence 
of signal points drawn from a finite-dimensional signal point constellation as in typical coded modulation, the 
data sequence is mapped into a sequence of signal points drawn from a so-called trellis constellation of 
available infinite-dimensional sequences so that there is no single N-dimensional constellation used repeatedly in 
each N dimensions. The region occupied by the available sequences in sequence space is shaped to minimize 
the required power; i.e., preferably the region is an approximation to the Voronoi region of the trellis code (where 
the trellis code is not just a repeated lattice code). 
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Mapping Digital Data Sequences 

This invention relates to modulation systems for sending digital data via a channel, and in particular to a 
method of mapping digital data, sequences into signal point sequences for data transmission. 

In traditional uncoded modulation systems, to send n bits per N signaling dimensions, a 2 n -point N- 
dimensional signal constellation is used. Each group of n bits is independently mapped into one of the 2 n 

s signal points. The selected signal point is then transmitted over a channel, e.g., by N uses of a pulse 
amplitude modulation (PAM) transmitter, or by N/2 uses of a quadrature amplitude modulation (QAM) 
transmitter. The signal points in the constellation may be selected and organized to achieve a good signal 
to noise ratio (SNR) efficiency. The key parameters that determine SNR efficiency are the minimum 
squared distance between the constellation signal points and their average power. 

10 In coded modulation systems, by contrast, the digital data are encoded as a sequence drawn from a set 
of allowable sequences of signal points. The allowable sequences are selected in such a way that the 
minimum squared distance between allowable sequences in the code is greater than the minimum squared 
distance between the components of the sequences, namely the signal points. This requires that not all 
possible sequences be allowable, which in turn means that the number of signal points in the constellation 

is must be expanded. Expansion of the constellation leads to some increase in the average power required to 
send n bits per N dimensions, but the increase in minimum squared sequence distance outweighs the 
increase in average power and a net 'coding gain' is achieved. 

There are two basic types of coded modulation systems block and trellis. 

In block coded systems, the code sequences are finite length blocks of signal points and are often 
20 called code words. Within a given block, the signal points depend on one another, but there is no 

interdependence between signal points of different blocks. Lattice codes (in which a code word is a point on 

a translate of a multidimensional lattice) may be regarded as block coded systems, if the code words are 

construed as defining sequences of signal points. 

In trellis-coded modulation (Ungerboeck, "Channel Coding with Multilevel/Phase Signals," IEEE Trans- 
25 actions on Information Theory, Vol. IT-28, No. 1, pp. 56-67, January, 1982), on the other hand, code 

sequences of signal points are in principle infinitely long, and there are signal point interdependencies that 

extend over the whole sequence. 

Typically, coded modulation systems have been designed jointly with an associated signal constellation 

or family of constellations. The constellation itself generally has been of the block type in which the subset 
30 of allowable sequences has been the subset of all sequences in the code whose N-dimensional signal 

points lay in some finite (e.g., 2 n + r -point) N-dimensional signal constellation. Thus, the constellation was the 

same in each N dimensions and not dependent on signal points outside those N dimensions. 

It has, been recognized that such constellations can achieve improved SNR efficiency if they are 

shaped to be as nearly spherical as possible. 
35 Our British Patent specification GB - A - 2207583 describes signal constellations which comprise points 

of a lattice (or a coset of a lattice) that lie within a so-called Voronoi region of a sublattice of original lattice. 

A Voronoi region of a lattice is the set of points that are as close to the origin as to any other lattice point. 

Thus, the use of a Voronoi region of a lattice to define the constellation boundary achieves the advantages 

of a quasi-spherical constellation. Such Voronoi constellations are also discussed in Conway and Sloane, "A 
40 Fast Encoding Method for Lattice Codes and Quantizers," IEEE Trans. Inform. Theory, Vol. IT-29, pp. 820- 

824, 1983, and Forney, United States Patent Application Serial Number 181,203, filed April 13, 1988 (which 

uses some points of the Voronoi constellation to support a secondary channel). 

We describe in detail below a so-called trellis shaping modulation system that achieves improved 

performance. 

45 Instead of mapping a digital data sequence into a sequence of signal points drawn from a signal point 
constellation as described above, the data sequence is mapped into a sequence space of signal points 
drawn from a so-called sequence space of available sequences that is not merely the Cartesian product of 
a single N-dimensional constellation used repeatedly in each N dimensions. The region occupied by the 
available sequences in sequence space is shaped to minimize the required power, e.g., the region may 

so consist of a fundamental region of a trellis code, and more particularly, an approximation to the Voronoi 
region of a trellis code (where the trellis code is not just a repeated lattice code). 

In general, in one aspect of the invention, a digital data sequence is mapped into a signal point 
sequence for data transmission by selecting the signal point sequence from a subset of all possible signal 
point sequences based on the digital data sequence, where all possible signal point sequences in the 
subset lie in a fundamental region of a trellis code, the fundamental region being other than a simple 
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Cartesian product of finite-dimensionai regions. 

Preferred embodiments of the invention include the following features: 

The possible signal point sequences in the subset are code sequences from a translate of a second 
code of which the trellis code is a subcode, the second code being of the trellis or lattice type. The 

s fundamental region is approximately a Voronoi region of the trellis code, and comprises the set of the 
possible signal point sequences that are decoded to the zero sequence in the trellis code by an 
approximation to a minimum distance decoder, e.g. a minimum-distance decoder with finite delay M, where 
M is greater than or equal to one. More broadly, the fundamental region may be the set of the possible 
signal point sequences that are decoded to a common error region of any decoder for the trellis code. 

to The step of selecting the signal point sequence includes mapping the digital data sequence into an 
initial signal point sequence belonging to and representing a congruence class of the trellis code, and then 
choosing a signal point sequence which belongs to the congruence class and which has smaller or equal 
average power compared to that of than the initial signal point sequence. In some embodiments, the 
mapping step includes mapping the digital data sequence into signal points belonging to an initial 

75 constellation comprising points lying within a fundamental region opf a time-zero lattice of the trellis code. In 
other embodiments, the mapping includes applying a portion of the elements of the digital data sequence to 
a coset representative generator to form a larger number of digital elements representing a coset 
representative sequence. 

In the latter embodiments, the digital data sequence may be recovered from a possibly noise-corrupted 

20 version of the signal point sequence by decoding the signal point sequence to a sequence of estimated 
digital elements, and then forming a syndrome of fewer digital elements based on some portion of the 
estimated digital elements using a feedback-free syndrome former, the syndrome former being anMnverse 
of the coset representative generator. 

In some embodiments, the signal points in said sequence belong to a 2D constellation and the step of 

25 selecting said signal point sequence is constrained so that the signal points in said sequence will be within 
some radius R of the origin of said 2D constellation. In particular, the step of choosing a signal point 
sequence belonging to the congruence class comprises decoding the initial signal point sequence into a 
final signal point sequence comprising signal points belonging to a 2D constellation, the decoding being 
constrained so that only final signal point sequences whose signal points have magnitudes no greater than 

30 some predetermined radius R from the origin of the constellation are used. 

In some embodiments, the step of choosing a signal point sequence belonging to the congruence class 
comprises (a) decoding the initial signal point sequence into a code sequence using a Viterbi algorithm, and 
(b) in each recursion of the Viterbi algorithm, effecting an operation that will assure that the decoded 
sequence is an allowable sequence in the second code. The operation comprises adjusting the metrics of 

35 selected historical paths in the trellis of the Viterbi algorithm, so that none of the selected paths will become 
the most likely path in the next recursion of the Viterbi algorithm. The historical paths are chosen based on 
whether they include particular state transitions at particular locations in the trellis of the Viterbi algorithm. 
The operation includes assigning a large metric to selected historical paths in the trellis. 

In some embodiments, the step of mapping the digital data sequence into a sequence of signal points 

40 belonging to an initial constellation includes converting the data elements in the data sequence into groups 
of bits for selecting signal points from the initial constellation, and the groups of bits are differentially 
encoded to assure that the bits can be recovered from a channel-affected version of the transmitted 
sequence which has been subjected to phase rotations of one, two, or three times 90 degrees. 

In another aspect of the invention, the digital data sequence is mapped into signal points by a 

45 technique which includes determining a class of possible sequences from which to select the sequence to 
be sent, based on digital data appearing in the digital data sequence up to a time j; and, for every possible 
time j, selecting the sequence to be sent from the class of possible sequences based on digital data 
appearing in the sequence after time j. 

In general, in another aspect the invention features selecting the sequence of signal points to be sent 

so from a class of possible sequences specified by digital data in a digital data sequence, the selection being 
based on the average power of the different possible sequences of the class, the selection being based not 
only on a finite block of the digital data. 

Preferred embodiments include the following features: 

Each data element (a binary b-bit value) in the data sequence is mapped into a point in an initial 
55 constellation of 2 b points. The resulting sequence of points from the initial constellation comprise a 
representative sequence of the class of possible sequences. The mapping of data elements to points in the 
initial constellation is linear or distance invariant. The mapping of each element is done independently of the 
mapping of any other element. 
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The sequence of points in the initial constellation is decoded by maximum likelihood sequence 
estimation in accordance with a trellis code. A final decision on decoding for a given point is delayed until at 
least one interval following time j. The fundamental region is a common error region of the decoder, and the 
sequence of points in the initial constellation is decoded to an error sequence within the common error 
5 region. The signal points in sequences within the common error region comprise a final constellation which 
is larger than the initial constellation. 

The shaping gain achieved by the above methods may be augmented by a coding gain. 

At the receiver, the digital data sequence is recovered by decoding the sequence of sent signal points 
to recover the sequence of points of the initial constellation, and then inverse mapping the sequence of 
70 points of the initial constellation to recover the digital data sequence. The decoding of the sequence of sent 
signal points is done on each point independently of the other points. 

Where a trellis code it used it may be a linear trellis code, e.g., a 4-state Ungerboeck code, or a dual 
Wei code. The trellis code may be based on a partition of binary lattices, or on a partition of ternary or 
quaternary lattices. 

75 The invention also features a modem that incorporates certain of the foregoing aspects. 
Our systems achieve large total gains with relatively low complexity. 

Other advantages and features will become apparent from the following description of the preferred 
embodiment, by way of example only with reference to the accompanying drawings in which:- 
Fig. 1 is a block diagram of an encoder for a trellis code; 
20 Fig. 2 is a block diagram of a mapping from one fundamental region of a trellis code to another; 

Fig. 3 is a block diagram or mappings from a simple fundamental region to a Voronoi region and vice 

versa; 

Fig. 4 is a block diagram of a map from b-bit words to a transmitted sequence e, and vice versa; 
Fig. 5 is a block diagram of a map using an Ungerboeck 4-state, 2D code; 
25 Fig. 5A is a chart illustrating lattice partitioning for the map of Fig. 5; 

Fig. 5B is a trellis diagram corresponding to Fig. 5, 

Fig. 6 is a block diagram of the Ungerboeck 4-state 2D mapping code combined with a coding 
operation based on a scaled version of the Ungerboeck code; 

Fig. 7 is a block diagram of a receiver for the sequence e of Fig. 6; 
30 Fig. 8 is a block diagram of a modulation system combining coding and shaping; 

Fig. 9 is a block diagram of an encoder, syndrome former, and right inverse for the code used in the 
Ungerboeck 4-state, 2D trellis; 

Fig. 10 is a block diagram illustrating the generation of a trellis constellation based on a mod-2 trellis 

code; 

35 Fig. 11 is a block diagram of a receiver for the Fig. 10 constellation; 

Fig. 12 is a block diagram of a general construction for a modulation system with both coding and 
shaping; 

Fig. 13 is a Wei code embodiment of Fig. 12; 

Figs. 14 and 15 are matrices associated with 8-state and 16-state 4D Wei codes, respectively; 
40 Fig. 16 is a graph of shape gain versus normalized informativity for trellis constellations; 

Fig. 17 is a block diagram of modem transmitter circuitry; 

Fig. 18 is a block diagram of the binary encoder or the circuitry of Fig. 17; 

Fig. 19 is a block diagram of the 16-state binary Wei convolutional encoder of Fig. 18; 

Fig. 20 is a block diagram of the syndrome former of Fig. 18; 
45 Fig. 21 is a constellation for Fig. 18; 

Fig. 22 is a 2D constellation for Fig. 18; 

Fig. 23 is a block diagram illustrating the operation of the decoder for the shaping code; 
Fig. 24 is a trellis diagram for the 8-state 4D dual Wei code; 
Fig. 25 is a block diagram of modem receiver circuitry; 
so Fig. 26 is a block diagram of the binary decoder of Fig. 25; 

Fig. 27 is a block diagram of the syndrome former of Fig. 26; 
Fig. 28 illustrates quadrant partitioning; 

Fig. 29 is a block diagram of the constellation former of Fig. 18 ; and 
Fig. 30 illustrates a quadrant labeling scheme. 

55 

Terminology and Principles 
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We first discuss some of the terminology and principles which underlie the invention. 



Lattices 

5 

An N-dimensional real lattice A is a discrete set of N-tuples (or points), i.e., elements of real Euclidean 
N-space R N (assumed to span R N ), that has the group property: the sum or difference of any two lattice 
points under vector addition is another lattice point. Two elements of R N (whether or not they are points of , 
the lattice) are said to be congruent mod A if their difference is a point in A. The set of all elements 
70 congruent to a given N-tupie a is called the coset (or congruence class, or translate) A + a of A. 

Geometrically, a lattice A is characterized by the minimum squared distance d min (A.) between its 
points, and by its fundamental volume V(A), which is the volume of N-space associated with each lattice 
point. 

75 

Fundamental Regions of Lattices 



A fundamental region R(A) of A is defined as a region of N-space that contains one and only point from 
20 each congruence class mo"d A. The volume of any fundamental region of A is V(A). 

The Voronoi region R V (A) of A is the set of all elements in R N that are at least as close to the -origin as 
to any other point in A. The Voronoi region R V (A) is the. closure of a fundamental region of A and has 
volume V(A). 

25 

Partitions of Lattices 



A sublattice a' of A is a subset of the points in A that itself is a lattice. We say also that A is a 
superlattice of a'. Since a' is a subgroup of A, A is the union of some number |A/a'| of cosets (congruence 
classes) of a'; thus a' induces a |A/A|-way partition of A. The partition is denoted as A/A . (More generally, 
any translate A + a of A is the union of |A/a'| cosets of A.) The number |A/A | is called the order of the 
partition A/A (or the index of a' in A). A set of coset representatives for the |A/A | cosets of A in A is any 
set of |a/|a'| N-tuples that includes one from each coset, and is denoted by [A/A ]. f 

If a' is sublattice of A, then the fundamental volume V(a') of A is equal to |A/A |V(A). 



Binary Lattices 



A binary lattice A is an integer lattice (a sublattice of the N-dimensional lattice Z N of integer N-tuptes) 
that has 2 n 2 N as a sublattice, so that Z N /A/2 n Z N is a lattice partition chain. The least such n is called the 2- 
depth of the lattice A, and A is then called a mod-2 n lattice. 

A mod-2 lattice A is thus an N-dimensional integer lattice that has 2Z N as a sublattice. Every mod-2 
lattice may be characterized as the set of all integer N-tuples that are congruent mod 2 to a codeword in 
some linear binary block code C of length N. If C is an (N,K) code, then the orders of the partitions Z /A 
and A/2Z N are |Z N /A| = 2 N ' K and |A/2Z N | = 2 K , respectively, and the 2 K codewords of C may be regarded as 
a set [A/2Z N ] of coset representatives for the |A/2Z N |-way partition A/2Z N , while any set of coset representa- 
tives [(N,N)/C] for the cosets of C in the set (N t N) of ail binary N-tuples may be taken (as integer N-tuples) 
as a set of coset representatives [Z N /A] for the cosets of A in Z N . We denote such a mod-2 lattice by the 
notation A( NiK ). 



Constituent 2D Lattices, Redundancy, Informativity 

55 



The constituent 2D lattice A 2 of an N-dimensional lattice A is the projection of A onto two dimensions 
(assumed to be the same for any choice of two of the N dimensions). Normally we scale integer lattices so 
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that A2 is Z 2 , the two-dimensional lattice of integer 2-tuples. If N is even and As is the same for each pair of 
dimensions, then A is a sublattice of (A2) N/2 . The redundancy r(A) of A is then defined so that 2 r{A ) = |(A2>- 
KW2 /A|, or |Z N /A| if A2 = Z 2 . The normalized redundancy p(A) of A (per two dimensions) is defined as p(A) = 
2r(A)/N. Thus the redundancy of a mod-2 lattice A {N ,K) is N-K, and its normalized redundancy is 2(1 -K/N). 

5 The constituent 2D sublattice A2 of an N-dimensional lattice A is the set of all elements of A whose 
components are all zero except in two specified dimensions, projected onto those two dimensions 
(assumed to be the same for any choice of two dimensions). If N is even and A2' is the same for each pair 
of dimensions, then (A 2 ) N/2 is a sublattice of A. 

For binary lattices, the constituent 2D sublattice A2' is some lattice in the infinite chain Z 2 /RZVR 2 Z?/... of 

10 versions R'Z 2 of the lattice Z 2 , where R is the two-dimensional norm-doubling rotation operator specified by 
the 2x2 matrix R = {[1,1], [-1,1]}, so that the order of each partition RJZ 2 /Rj + 1 Z 2 is 2. Note that R 2 Z2 = 2Z 2 . 
The depthu of a binary lattice A is the least integer is u such that R U Z 2 is a sublattice of A. The 
informativity k(A) of A is then defined so that 2 k(iV > = |A/(A 2 ') N/2 |, or |A/R U Z N | if A 2 ' N/2 = R U Z 2 , where R U Z N 
means (R U Z 2 ) N/2 . The normalized informativity *(A) of A (per two dimensions) is defined as x(A) = 2k(A)/N. 

is Thus the informativity of a mod-2 (and depth-2) lattice A (N|K ) is K, and its normalized informativity is 2K/N. 



Lattice-type Trellis Codes 

20 

Sequences 



A sequence x = (xo, xi,...) of elements Xj from an alphabet A is defined to start at time j = 0. The set of 
25 all possible such sequences will be denoted as A°°. Thus the set of all sequences whose elements belong 
to a lattice A is denoted as A°°. For any lattice A the set A 00 has the group property under sequence 
addition and is thus an infinite-dimensional lattice. 



30 Trellis Codes 



A lattice-type trellis code C (hereafter, simply a trellis code) is defined by a lattice partition A/a' and a 
convolutional encoder E (or a convolutional code C). Hereafter, A and a' will always be taken to be binary 
35 lattices; then the order |a/a'| of the partition A/a' is a power of two, say 2 k + r . Thus each of the 2 k+r cosets 
of A in A may be identified by a (k + r)-bit label. 

Referring to Fig. 1, encoder E 10 is a rate-k/(k + r) binary convolutional encoder; i.e., a finite-state 
machine whose input sequence x = (xo, xi,...) is a sequence of binary k-tuples Xj 12, and whose output 
sequence y = (y 0 , yi, ...) is a sequence of binary (k + r)-tuples y, 14, such that the input-output map is 
40 linear (under addition mod 2) and time-invariant. The encoder E starts in a definite state, called the zero 
state, and remains in the zero state as long as the inputs Xj are all zero, during which time the outputs yj 
also remain all zero. The output (k + r)-tuples yj select a sequence of cosets 1 6 of a' in A (for the partition 
1 8 A/A ) according to some labeling scheme. 

The trellis code C(A/A ;C) then consists of all permissible sequences c of elements of A that fall in a 
45 sequence of cosets of a' that could be selected by the encoder E; i.e., that have labels that are in code 
sequences in the convolutional code C. Thus C(A/a';C) is in general a proper subset of the lattice A°° f 
because some sequences in A 00 are not permitted in the trellis code C. There is a trivial case in which C is 
A 00 , which exhibits A°° as a degenerate 'one-state' linear trellis code. ~ ~~ 
(Conventional codes often use signal points from a translate A + a of A. We regard the sequences in 
50 such a code as translates C(A/A';C) + a°° of the code sequences in C(a/a';C), whose elements are 
constrained to be in A itself.) 

The redundancy r(C) of a trellis code C(A/A ;C) is defined as the sum of the redundancies of the lattice 
A and the code C, r(C) = r(A) + r, and its normalized redundancy is defined as p(C) = 2r(C)/N. Its 
informativity is k(C) = k(A) + k, and its normalized informativity is *(C) = 2k(C)/N. Since^/A/A'/R^Z" is a 
55 partition chain with orders |Z N /A| = 2 r<A >, |AM| = 2 k+r , and |a'/R u Z n | = 2 k(A ), where u is the depth of a', 
and |Z N /R U Z N | = 2 uN/a , we have r(C) + k(C) = uN/2. 
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Linear Trellis Codes 

In a linear trellis code C(A/a';C), the sum of any two code sequences is a code sequence. 

A labeling of the cosets of a' in A is called linear if the label of the sum of two cosets of A is the sum 
(mod 2) of their (k + r)-bit labels, where the sum of two cosets of a', say A+a and A +b, is defined as the 
coset a' + (a + b) that contains all sums of elements of the two cosets. 

If the zero label corresponds to the zero coset of a' (a' itself), as is always true with a linear labeling, 
then the set of all sequences in C(A/a';C) selected by the code sequence of zero valued elements in C. is 
(a') 00 , so (a')°° is a subset of C(A/a';C). Thus if C(A/A;C) is a linear trellis code, then A°°/C(A/a';C)/(A )°° is 
an infinite-dimensional lattice partition chain. 

In the special case where both A and a' are mod-2 lattices, say~A (NiK ) and A {NiK '), then it is always 
possible to choose the labeling so that C(A< n , k >/ a <n.k');C) is a linear trellis code. For then the (N,K ) code 
must be a subcode of the (N,K) code, and the 2 K ' K cosets of A< N(K ') in A (N)K) are isomorphic to the cosets of 
the (N,K') code in the (N,K) code. There is a linear map (mod 2) from binary (K-K)-tuples to coset 
representatives of the (N,K) code in the (N,K) code, which may also be taken as coset representatives of 
the cosets of A (N k') in A< N)K ). If the labeling is such a linear map, then C^n.k/A^.k'); C) is a linear trellis 
code. In this case we call C a mod-2 trellis code, and (Z n ) co /A 00 /C(A/a';C)/(A / ) 00 /(2Z n ) 00 is a lattice partition 
chain. 

If C is a mod-2 trellis code, then it can always be represented as a code C(Z N /2Z N ;C ) based on the 2 - 
way partition Z N /2Z N and an augmented rate-k(C)/[k(C) + r(C)] convolutional code C', where k(C) + r(C) = N. 
With an appropriate labeling, C may then be described as the set of all integer N-tuple sequences that are 
congruent mod 2 to binary N-tuple code sequences in the code c'. For simplicity, we shall hereafter 
consider all mod-2 trellis codes to be in this form. We say that the trellis code C is generated by the rate-k- 
(C)/N convolutional code c\ 



The Time-Zero Code and Lattice 

For any rate-k/(k + r) convolutional encoder E, the time-zero code Co may be defined as the set of all 2 k 
output (k + r)-tuples y 0 that could be selected at time j=0 by the 2 k possible time-zero input k-tuples x 0 . By 
linearity, Co is a (k + r, k) linear binary block code (assuming that all such y 0 are distinct). 

Correspondingly, the time-zero lattice A 0 of C is defined as the union of the 2 k corresponding cosets of 
a'. (The definition of C given above is not sufficient to require this union to be a lattice, but in practice it 
always is.) Thus the time-zero lattice Ao is a superlattice of a' of order 2 k , and a sublattice of A; since |A/A | 
= 2 k + r , it follows that |a/a' 0 | = 2 r . 

If C(Z N /2Z N ; C) is a mod-2 trellis code based on the partition Z N /2Z N and a rate-k/N code C, then its 
time-zero lattice is a mod-2 lattice A 0 = A (Nik) comprising all integer N-tuples that are congruent mod 2 to a 
codeword in the time-zero code Co, which is an (N,k) binary block code. 

The time-zero code represents the set of possible output (k + r)-tuples y 0 = y(x 0 ) when the encoder 
state is zero. For any encoder state Sj, if y(0, Sj) is the encoder output when Xj = 0, then by linearity and 
time-invariance the output is y(x jt Sj) = y(xj) + y(0, Sj) if the encoder input is x i( so that the set of possible 
output (k + r)-tuples is a coset of the time-zero code Co. Correspondingly, in any practical coset code C, the 
set of possible signal points corresponding to any encoder state Sj is a union of 2 k cosets of A in A that 
forms a coset of the time-zero lattice Ao in A. 

If C(Z N /2Z N ; C) is a mod-2 trellis code with a time-zero code Co and time-zero lattice Ao - A (Nik ), then a 
coset of Ao is the set of all integer N-tuples that are congruent mod 2 to any word in a given coset of Co, 
and we may take any set t(N,N)/C 0 ] of coset representatives for the cosets of Co as a set [Z N /A 0 ] of coset 
representatives for the cosets of Ao in Z N . 



Fundamental Regions of Linear Trellis Codes 

For a linear trellis code C (an infinite-dimensional lattice), if R(C) is a fundamental region of C, then the 
set of translates of R(C) by c as c ranges through all code sequences in C is an exhaustive partition of 
sequence space (Ft*)™, as will be shown below. Many good trellis codes, such as the 4-state 2D 
Ungerboeck code (Ungerboeck, "Channel Coding with Multilevel/Phase Signals," cited above) and most of 
the Wei codes, Wei ("Trellis-coded Modulation with Multidimensional Constellations, IEEE, Trans , Inf. 
Theory, Vol. IT-33, pp. 177-195, 1987) are based on partitions of mod-2 lattices with linear labelings, and 
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thus are linear. 

Other known lattice-type trellis codes, such as the remaining Ungerboeck codes and most of the 
Calderbank-SIoane codes, (Calderbank and Sloane, "New Trellis Codes Based on Lattices and Cosets," 
IEEE Trans. Inf. Theory, vol. 25, no. 1, pp. 12-21, 1987), are based on partitions A/a' in which a' is a mod-4 

5 binary lattice - The labeiings used with these codes are not linear, but satisfy a weaker condition called 
•regularity* which assures that the distribution of distances from a given code sequence c to all other code 
sequences in C is independent of c. When C has such a distance-invariance property, we expect that 
sequence space may be exhaustively partitioned into regions that are geometrically similar, each centered 
on one of the code sequences c in C, so that we may take the region centered on the zero sequence as a 

10 representative fundamental region. 



Decoders and Fundamental Regions for Linear Trellis Codes 



75 An exhaustive decoder of a trellis code C is any map c(r) from arbitrary sequences r in sequence space 
(R N )°° to sequences c in C. The decision region R(c) associated with any particular code sequence c is the 
set of ail r that map to c.~"the apparent error, or simply error, associated with r is then e(r) = r - c(r), and 
the error region associated with c (the set of apparent errors e(r) for all r such that c(r) = c) is then R e (c) = 
R(c) - c. If we assume that every decision region R(c) includes c, then every error region R e (c) includes the 

20 zero sequence 0. The set of all decision regions R(c) for all c in C forms an exhaustive partition of 
sequence space (R N )°°. 

A fair, exhaustive decoder of a linear trellis code C is a map such that if r maps to c, then r + c maps to 
c + c' for ail c' in C. It follows that if the decision region corresponding to the code sequence 0 is R(0), then 
the decision region corresponding to any code sequence c is R(c) = R(0) + c, and there is a common 

25 error region R e (c) = R e (0) = R(0) for all c in C. Thus there is an exhaustive partition of sequence space 
(R N )°° consisting of the" translates R(0) + c of the common error region R(0) for all c in C. 

If C is a linear trellis code, Two sequences in (R) N )°° will be said to be congruent mod C if their 
difference is an element of C. Since C is a group under sequence addition, congruence mod C is an 
equivalence relation and partitions sequence space (R N )°° into equivalence (congruence) classes. A 

30 fundamental region R(C) of a linear trellis code C is then a set of sequences that includes one and only one 
sequence from each" congruence class mod C (i.e., a set of coset representatives for the cosets of C in 
(R N )°°. Then every sequence r in (R N )°° can be uniquely expressed as a sum r = c + e for some c in C 
and e in R(C); i.e., there is a coset decomposition which may be written as (R N )°° = C + R(C). 

ClearTyrR(C) is a fundamental region of a linear trellis code C if and only if it is the common error 

35 region R(0) of "a fair, exhaustive decoder for C. Thus we may specify fundamental regions by specifying 
appropriate decoders. 

Simple Fundamental Regions of a Linear Trellis Code 



A simple fundamental region of a linear trellis code C may be specified by a 'hard-decision decoder* for 
C as follows. 

~ Let Ro be a simple fundamental region of the time-zero lattice A 0 . Then, for any coset A 0 +a of A 0 
every point r in R N may be uniquely written asr = (c + a) + e, where (c + a) is an element of Ao +a and e is 
45 an element of R 0 . This specifies a fair, exhaustive decoder for the translate A 0 + a of A 0 that has the 
common error region Ro. 

At time j = 0, the hard-decision decoder receives the first element r 0 of r, and uses the above decoder 
for Ao to make a 'hard decision' on the first element c 0 of c. This determines an estimated state si at time 
j = 1 of the encoder E, which in turn determines the set of possible next elements ci, a coset Ao +a(Si) of 

so the time-zero lattice Ao - Then, given the next element n of r, the hard-decision decoder may use the above 
decoder for Ao + a(si) to make a hard decision on the next element ci of c r and so forth. 

For any c, this hard-decision decoder has a common error region equal to (Ro)°°. the set of all 
sequences of elements of Ro- It is therefore a fair, exhaustive decoder (albeit not a very good one), and 
(R 0 )°° is thus a simple fundamental region of the linear trellis code C. 

55 ~ The fundamental region (Ro)°° is a Cartesian product of fundamental regions Ro, and clearly may be 
regarded as having a volume~bf V(R 0 ) = V(A 0 ) per N dimensions. Any other fundamental region of C is 
congruent to (Ro) 00 mod C, and therefore may be regarded as having the same volume. Thus the 
fundamental volume of a trellis code C with time-zero lattice Ao is V(A 0 ) per N dimensions. 
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The Voronoi Region of a Linear Trellis Code 

For an arbitrary set of sequences, there is some conceptual difficulty in specifying a minimum-distance 
sequence decoder, because the squared error (squared Euclidean distance) Hell 2 between a given sequence 

5 r and every sequence in the set will usually be infinite. However, when the set of sequences is a trellis code 
C generated by a finite-state encoder E, a minimum-distance decoder may be operationally specified as the 
code sequence c(r) generated by a Viterbi algorithm decoder for C with unlimited memory, when the 
decoder input is the sequence r. The Voronoi region R V (C) of a trellis code C may then be defined as the 
set of all r in (R N )°° that map to the code sequence 0; i.e., as the decision region R(0) of a minimum- 

w distance Viterbi algorithm decoder. If C is a linear trellis code, then this is the common error region R(0) of 
such a decoder for any code sequence c. 

Issues such as what the decoder does when two paths of the trellis converge at some state with the 
same metric (squared distance), or when paths do not converge in their initial segments for an arbitrarily 
long time, basically involve what the decoder does in the case of ties, when the sequence r lies on the 

75 boundary between decision regions corresponding to different code sequences. It is clear that, as with 
Voronoi regions of lattices, these questions can be largely avoided by defining the Voronoi region R V (C) to 
include all of its boundary points. Since the boundary of the Voronoi region is a region of measure zero in 
sequence space, it usually will not matter how boundary ties are resolved. In constellation design, however, 
it may be necessary to pay special attention to boundary points and ties. 

20 

Finite-Memory Viterbi Decoder 



25 A practical Viterbi algorithm has a finite decoding delay M. Such a decoder makes a hard decision on 
the first component c 0 of c, and thus on the state Si of the encoder, on the basis of the first M + 1 
components (r 0 , n, r M ) of r. Then, given the estimated state s^ , it makes a hard decision on ci based on 
(ri, r 2 , .... r M+1 ), and so forth. Such a decoder is suboptimum, but as M goes to infinity its performance 
approaches that of a Viterbi decoder with unlimited memory. Note also that for M = 0, such a decoder 

30 reduces to the hard-decision decoder described above, with R 0 being the Voronoi region of the time-zero 
lattice A 0 . Thus there is a sequence of decision-feedback' Viterbi decoders with increasing decoding delay 
M, starting with the optimum hard-decision decoder (zero delay) and approaching the optimum minimum- 
distance sequence estimator (infinite delay). It is clear that performance must monotonically increase with 
M. As with other applications of maximum likelihood sequence estimation (e.g., convolutional codes, 

35 intersymbol interference), the performance of such finite-memory decision-feedback decoders can be 
analyzed using variants of techniques applicable to blocks of length M + 1, so that infinite-dimensional trellis 
codes can be analyzed by variants of techniques applicable to finite-dimensional lattices. 
Later, we shall need the following lemma: t , ^ 

Lemma: The Voronoi region R V (C) of a trellis code C(A/A ;C) is contained in [R V (A )] , the set of all 

40 sequences of elements of the Voronoi region R V (A ) of A . 

The proof is that because C is a union of cosets of (A )~ in principle one could realize a minimum- 
distance decoder for C by decoding each of the cosets of (a') 00 separately, and then choosing the closest 
of the resulting sequences. The common error region for a minimum-distance decoder of any coset of (A ) 
is [R v (A / )] O0 1 so the final error sequence must be contained in [Rv(A')f. Note that this proof does not 

45 depend on the linearity of C and thus is valid for general trellis codes. 

An immediate corollary is that if N is even and A 2 ' is the constituent 2D sublattice of A , then R V (A ) is 
contained in [Rv(A 2 ')] N/2 (since (A 2 ') m is a sublattice of a'), and therefore the Voronoi region of C is 
contained in ([R v (A 2 ')] N/2 )°°. If a' is a binary lattice of depth u, therefore, the Voronoi region of C is 
contained in [Rv^Z 2 )] 00 , an infinite Cartesian product whose constituents are two-dimensional squares of 

so area 2 U , centered on the origin. 



Mapping Between Fundamental Regions 

Referring to Fig. 2, any fundamental region R 20 of C 22 may be mapped into any other fundamental 
region r' 24 by the map that takes a point of R to the unique point of R in the same congruence class 
mod C"Operationally, this map may be performed by a fair, exhaustive decoder 2C for C with common 
error region r\ with the input to the decoder being a point r in R, and the output being the error e = r-c, 
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which must be the unique point that lies in the common error region r' and is congruent to r mod C. The 
map may of course be inverted by a decoder with common error regiorTR. ~" 

Referring to Fig. 3 t for example, we may take r to be a sequence~from a simple fundamental region 
(Ro)°° 30, and use a Viterbi decoder 32 to map r to a sequence e in the Voronoi region Rv<C) 34 of C that is 
5 congruent to r; to invert this map, we use a simple hard-decision decoder 36, using aldecoder for~Ao with 
common error region R 0 , to recover the sequence r from e. 



Shape Gain 

10 

The shape gain of a fundamental region of sequence space that is defined by a trellis code C and an 
associated decoder will be defined as follows. ~" 

Let Ro be any fundamental region of the time-zero lattice Ao of C, with volume V(A 0 ). Define a uniform 

75 continuous probability distribution over Ro, with probability density p(r) = 1/V(A 0 ), and let this distribution 
define a corresponding uniform distribution over sequences r in (Ro)°°. Finally, define a uniform distribution 
over the sequences e in the fundamental region R = R(0), the common error region of the decoder, as the 
distribution induced by the decoder map from r to e. We assume that the mean P(R) of the norm llejll 2 of 
the N-dimensional components ej of e is well defined under this uniform distribution over R, at least as 

20 (A practical method of estimating P(R) is by Monte Carlo simulation, using a pseudo-random uniform 
distribution over a discrete but dense set of points uniformly distributed across Ro, and a decoder that 
maps a sequence of these points into the corresponding sequence in R = R(0).) "~ 

The normalized second moment of R is then defined as G(R) = [P(R)/N]/[V(R 0 )] 2/N , where P(R)/N is the 
average power per dimension, and [N^Ro)] 27 * is a normalizing "factor that scales" as the square erf a linear 

25 dimension, like P(R). If R 0 is an N-cube of side R and R = (Ro)°°, then P(R) = NR 2 /12 and V(R 0 ) = R N , so 
G(R) = 1/12, which we define as the baseline normalized second moment Go = 1/12. The shape gain 7 S (R) 
is then defined as the ratio -y s (R) = G 0 /G(R), or 7s (R) = 1/[12G(R)]. " 

For a hard-decision decoder based on a fundamental region R 0 of the time-zero lattice Ao of C, the 
normalized second moment and shape gain reduce to the corresponding parameters G(Ro) and y s (Ro) of 

30 the N-dimensional region Ro. In particular, since a Viterbi algorithm with delay zero is" a hard-de"cision 
decoder with R 0 = R v (Ao), the Voronoi region of Ao, the shape gain of a trellis code C with such a decoder 
is the same as the shape gain 7s(Ao) of the Voronoi region of the time-zero lattice Ao~of C. Since a Viterbi 
decoder with delay M cannot perform worse than one with less delay, the average power P(R) must 
decrease monotonicaily with M, and therefore the shape gain increases monotonically from 7 S (Ao)"with M. 

35 We assume that this sequence of shape gains approaches a limit, and identify this limit as the shape gain 
7 S (C) of the trellis code C. 



Constellation Expansion Ratio 

40 

We take as an initial constellation a set of N-dimensional points from the region Ro, whose volume is V- 
(Ro) = V(Ao). If we use a minimum-distance sequence decoder, then the common error region R = R(0) 
will be the Voronoi region of C. By the Lemma, this region is contained in [Rv(a')]°°, so~the Fnal 

45 constellation is a set of N-dimensional points from the region R v (a'), whose volume is" V(a'). Assuming that 
the final constellation fills R v (a'), the final constellation occupies a volume which is a factor of V(a')/V(Ao) = 
|A 0 /A | = 2 k times as large as that occupied by the initial constellation in N-space R N , where k is the 
informativity of the convolutional code C. We may therefore define 2 k as the shaping constellation 
expansion ratio in N dimensions. 

so In two dimensions, the final constellation is contained within R V (A2 ), where A2' is the constituent 2D 
sublattice of a'. The Cartesian product (A 2 ') N/2 has a fundamental volume which is VKA^^yViA) = 
[A /(A2 ) N/2 | times as large as that of a' in N-space, or 2 k{ ^ ) times as large, if a' is a binary lattice of 
informativity k(A'). Thus V^Aa')" 72 ] is a total of 2 k< £ > times as great as V(Ao) in N-space where k(C) = k + 
k(A ) is the informativity of the code C. Normalized to two dimensions, the constellation expansion ratio is 

55 [2 k{ Q)f^ = 2*< C ) t where x(C) = 2k(C)/N is the normalized informativity of the code C. We denote this 
constellation expansion ratio by CER s (Cf= 2*( S. ) since it is due to the use of C for constellation shaping. 
(By contrast, when C is used for coding, the associated coding constellation expansion ratio is CER C (C) = 
2P< C ).) ~ "** 
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Construction of Trellis Constellations 

A linear trellis code can be used in the construction of so-called trellis constellations, which we also call 
'trellis shaping. 1 We denote the trellis code by C S (A S /A S '; C s ), where the subscripts stand for 'shaping 1 (to 

5 distinguish, as explained below, from the codes C c (Ac/A c '; C c ) to be used for coding), where C s is now a 
rate-k s /(k s + r s ) code. For large numbers of bits per symbol, we can achieve the shape gain 7s (C s ) of C s , at 
the cost of a shaping constellation expansion ratio of CER S (C S ). 

(We assume without loss of generality that both C s and C c are based upon lattice partitions in the same 
number of dimensions, N. For if A s and A s ' are N-dimensional and A c and A c ' are M-dimensionai, then we 

70 can use the Cartesian product codes ^ and C£ , which are based upon the M N-dimensional partitions (A s )- 
M /(A S ') M and (A C ) N /(A C ') N , respectively, to arrive at shaping and coding codes of the same dimension.) 

Let A be any superlattice of A s . If A 0s is the time-zero lattice of C SI then A is also a superlattice of a 0s . 
Assuming that all latices are binary lattices, the order |A/A 0s | of the partition A/A 0s is 2 b for some integer b. 
Then there are exactly 2 b points from any coset A + a of A in any fundamental region Ro of A 0s , one in each 

75 of the 2 b cosets of A 0s of which A + a is the union. 

Referring to Fig. 4, a trellis constellation with points from A + a that can represent b bits per N 
dimensions can therefore be constructed as follows. Let each b-bit data word 40 specify one of the |a/a 0s | 
= 2 b points 42 from A + a in some simple fundamental region Ro of A 0s . We call this 2 b -point, tri- 
dimensional constellation the initial constellation C lnit . The set (Cum^of all sequences r 44 of constellation 

20 points rj is then capable of uniquely representing each possible sequence of b-bit data words. Then decode 
such a sequence r in (C mii )°° with a fair, exhaustive decoder 46 for the shaping code C s , with common error 
region R(0). Such a decoder will map r into the unique sequence e in R(0) 48 that is congruent to r mod C s . 
Then els the transmitted sequence. The set of all possible e, is called the final constellation C, im so e is an 
element of (C^) 00 . The sequence r can be recovered from e by a simple hard-decision decoder 50 based 

25 on the simple fundamental region R 0 of A 0 , and the original b bits 40 can be recovered once we knqw -r, by 
an inverse map 42. 

Normally decoder 46 is a minimum-distance decoder for C s (or a close approximation thereto). Then the 
common error region R(0) is the Voronoi region R V (C S ) of C s (or a close approximation thereto). As 
discussed above, the final constellation dm then has nominally |A/A S |/|A/A 0s | * | W^s I = 2 k s * imes as 
30 many points as the initial constellation C init in N dimensions. (We say 'nominally' because of possible 
boundary ties.) However, because e is chosen as the sequence of least power in the congruence class that 
contains r, a power savings (measured by the shape gain) is in fact achieved. 



35 Example Based on Ungerboeck 2D Code 

One well-known trellis code is the Ungerboeck 4-state 2D code (Ungerboeck, "Channel Coding 
supra) which we denote as Cu. In coset code notation, Cu is based on the 4-way lattice partition A/A = 
Z2/2Z 2 and on a certain rate-1/2 4-state binary convolutional code C. Cu consists of all integer sequences 
40 that are congruent mod 2 to code sequences in C. Since the code C is linear under mod 2 addition, the 
sum of two sequences in Cu is in Cu, so Cu is a group under sequence addition and thus an infinite- 
dimensional lattice. 

(Conventionally the coordinates of an integer code sequence from Cu are offset by 1/2 for transmission, 
so that the 2D signal points are drawn from the half-integer grid Z 2 + (1/2,1/2); because we require Cu to 

45 be a group under sequence addition, we regard the conventional code as a translate, or coset, of C u .) 

Referring to Fig. 5, to send b = 3 bits per two dimensions, first map each 3-bit word 52 into one of 
eight 2D points r, 54 (the mapping is defined by the unique three-bit label of each point 54 in the Figure) 
which form an initial constellation 56. Constellation 56 consists of the eight elements of the translate 
(Z/2 + 1/4) 2 of the lattice (Z/2) 2 that lie within the simple fundamental region 55 [0,2)x[0,1) of the lattice RZ 2 , 

so which is the time-zero lattice of Cu. The time-zero lattice RZ 2 is the union of two cosets A, D of the lattice 
2Z 2 , where the diamonds 63 indicate points of the two cosets A and D. Referring to Fig. 5A, the partitioning 
(Z/^/RZ 2 is an eight-way partition which can be refined into the chain of two-way partitions (Z/2) 2 /R(Z/2)- 
2/Z2/RZ 2 . Note that the Ungerboeck code sublattice a' = 2Z 2 is also a sublattice of the time-zero lattice A 0 
= RZ 2 . 

55 Referring to Fig. 5B, in the corresponding trellis 70, at each time j, j + 1, j + 2 there are four 

possible states 72 (labeled 1, 2. 3, and 4). Each branch 74 represents a coset of 2Z 2 . Each coset 2Z 2 (A, B, 
C, or D of Fig. 5) appears in two of the branches extending between possible states at a time j and the 
possible states at a time j + 1 . Each encoder state is associated with one of the cosets of RZ 2 , namely 
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either RZ 2 , or the translate RZ 2 + (1,0). (The translate consists of the cosets B and C of 2Z 2 , indicated by 

the squares labeled B and C on Fig. 5.) 

State 1 or state 2 is thus reachable only from prior state 1 or state 3 via branches labeled A or D 

(corresponding to RZ 2 ), and state 3 or state 4 is reachable only from prior state 2 or 4 via branches labeled 
s B or C (corresponding to the translate RZ 2 + (1 ,0)). 

In operation, the map of Fig. 5 first maps each three bits of data 52 to the corresponding point 54 in 

constellation 56. Then the decoder 58, in accordance with the trellis of Fig. 5B, decodes the resulting 

sequence of points to a code sequence c of points in the four subsets A, B, C, D on a maximum likelihood 

sequence estimation basis, using the Viterbi algorithm. Finally, the error sequence between r and c is 
10 derived and the sequence of error values e, (which are points in the final constellation 64) is used for 

transmission. 

Decoder 58 could have any selected delay M before making a final decision of which final constellation 
signal point is to be transmitted. 

Let the resulting sequence r 57 be decoded by any minimum distance decoder 58 for Cu with delay M, 

15 and let the resulting error sequence e 60 be the transmitted sequence. The set of ail possible points ej 62 
produced by this decoder is the final constellation 64. If the decoder for 0 U is a minimum-distance Viterbi 
decoder, then the final constellation is the 16-point constellation shown. To~recover r from e, we may use a 
'hard-decision decoder' to map independently each component ej of e back into one of the 8 points r, in the 
initial constellation as shown in Fig. 4, with little complexity and no delay. 

20 Simulations have shown that the code Cu yields a shape gain of 0.94 db with a delay-15 decoder and 
0.97 dB as M goes to infinity. ~~ 



Shaping Gain Combined with Coding Gain 

25 

The above example illustrates an important property of trellis constellations: the mapping from the 
sequence r to the sequence e only affects the 'most significant bits'. Because c is an integer sequence, the 
'fractional parts' of the components ej of the sequence e must be the same as those of r. Thus if we regard 
the 2 b points in the initial constellation as being specified by (a) b-1 bits that specify the 'fractional parts' of 

30 the two coordinates, and (b) one final bit that specifies whether the point is in the square [0,1)x[0,1) or the 
translated square [1,2)x[0,1) of Fig. 5, it is easy to see that the former b-1 bits are immediately recoverable 
from the transmitted sequence e, with no delay and no error propagation. (As explained below, the recovery 
of the final bit requires a transmitter 'preceding' operation to avoid error propagation.) 

An important consequence of this property is that the 'least significant bits' may be encoded by another 

35 trellis code C c to obtain sequence minimum squared distance d ^ jn (C c ) which is greater than the 
minimum squared distance between points in the initial or final constellations; because these bits are not 
affected by the shaping mapping from r to e of Fig. 5. this sequence distance d £in (C c ) is preserved, 
and a decoder for C c may be used on the noisy received sequence to form the receiver's estimate of e, so 
that the coding gain~7 C (C c ) of the code C c is achieved. Thus the shaping operation is completely compatible 

40 with a separate coding operation, and together they achieve a total coded modulation gain which is the 
combination of the shape gain 7s(Cu) (in this example) with the coding gain 7 C (C C ). 



Ungerboeck Code Example of Combined Coding and Shaping 

45 

^For example, let us use as C c a scaled version of the same Ungerboeck code C U( which has 
d min (Cu) = 4, redundancy r(Cu) = p(Cy) = 1 bit per two dimensions, and thus a coding gain 7 C (Cu) 
= 2* p dms n of a factor of 2, or 3.01 dB. More precisely, let us use the scaled version of this code, say 2' m * 
1 C Ut which is based on the same code C, and on the scaled lattice partition Ac/A c ' = 2' m ' 1 Z 2 /2' m Z 2 1 rather 
so than on Z2/2Z 2 . 

The scaled code sequences c are those sequences that are congruent mod 2" m to code sequences in C 
scaled by 2' m '\ The scaled code 2" m * 1 Cu consists of sequences of points from a translate of ^"^Z 2 , say 
(2" m * 1 Z + 2" m * 2 ) 2 , and has minimum squared distance d ^in (£c) = 2" 2m between scaled code se- 
quences, which remains 4 times better than the minimum squared distance z Zm ' 2 between constellation 
55 points. The coding gain y c (Cu) = 2 is unaffected by scaling. 

Because scaled code sequences are from, e.g., (2" m " 1 Z + 2* m " 2 ) 2 , the initial sequence r in Fig. 5 may in 
fact be a code sequence from the scaled code 2* m * 1 Cu, with points from a 2 2m+3 -point initial constellation 
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Referring to Fig. 6, in a coded modulation system employing the Ungerboeck 4-state 2-D code that 
incorporates both coding and shaping, one bit per two dimensions 80 enters a rate-1/2 encoder E 82 for the 
code C Ul and the two resulting coded bits select one of the 4 cosets of 2' m Z 2 84 whose union is (2* m * 
i z + 2 *m-2)2_ a further 2m uncoded bits 86 select one of the 2 2m cosets of Z 2 88 whose union is the selected 

5 coset of 2' m Z 2 (the combined selection results in one of 2 2m+a cosets of Z 2 ), and a final bit 90 selects 92 
either the square [0,1)x[0,1) or the translated square (1,2)x[0,1), thus determining a point rj in the initial 
constellation 94. A decoder for Cu 58 as before then converts the sequence r of initial constellation points 
into a transmitted sequence e 60. 

Referring to Fig. 7, a receiver for such a coded modulation system operates as follows. First, a decoder 

w 96 for the scaled code 2* m ' 1 Cu estimates the transmitted sequence e (corrupted by noise sequence n 98) by 
maximum likelihood sequence estimation. If this estimated sequence e' is correct, then there is an inverse 
map 100 to the initial sequence r, and from this the original bit sequences 102 can be determined 104 as in 
Fig. 4. (Below we discuss how to prevent error propagation). 

In summary, the coded modulation system of Fig. 6 takes in 2m +2 bits per two dimensions and 

75 produces a sequence e from the final constellation, which is the set of 2 2m+4 points from (2" m ' 1 Z + 2' m ^) 2 
which lie in the square region [-1,l]x[-1,1]. The total constellation expansion ratio, compared to a 2 2m + 2 - 
point uncoded square constellation, is thus a factor of 4, of which a factor of 2 is due to coding, and a factor 
of 2 to shaping. A total gain of 3.95 dB may be achieved, of which 3.01 dB is due to coding and 0.94 dB to 
shaping. 

20 (While shape gains are ultimately limited to 1.53 dB, which is much smaller than typical coding gains, 
we have found that after the initial 3-4 dB of coding gain it takes of the order of a doubling of coding 
complexity to achieve each 0.4 dB increase in effective coding gain, so that at some point it is easier to 
achieve further gains by more complex shaping rather than by more complex coding. In this example, for 
instance, the total gain is about the same as that obtainable with a 16-state Ungerboeck code used for 

25 coding, with no shaping.) 



Combining Shaping with Coding, In General 

30 In general, let C C (A C /A C '; C c ) be a trellis code to be used for coding, and let C S (A S /A S ;C S ) be.another 
(linear) trellis code to be used as the basis for a trellis constellation to be used with C c . If A c , A c . A s ,^and A s 
are all N-dimensional binary lattices, then because 2 n Z N is a sublattice of A c ' for some n, if we scale the 
code C c by 2"™ for any m£0 (i.e., replace C c by 2- m - n C c , the code based on the scaled lattice partition 2" m ' 
"Ac/^Ac' and the same convolutional code C c ), then 2' m ' n A c '/2* m Z N /Z N /A s is a lattice partition chain. 

35 (Equivalently, we could scale C s by 2 m + n .) 

The code sequences of 2* m " n C c are the sequences in a coset of (2- m * n A c ) that lie in one of the 
sequences of cosets of (2"™ Ac') 00 that could be selected by a code sequence from C c . They can therefore 
be defined as those sequences r from a coset of (2' m - n A c )°° that satisfy a certain set of constraints on 
congruence mod (2™^)°°. Because (A s )°° is a sublattice of (2* m ' n A c )°° for m*0, any sequence e that is 

40 congruent to r mod C s (and thus, a fortiori , mod (A s )°°) will remain in the same congruence class as r mod 
(2' m ' n \cf°, and will thus remain a'legitimate code sequence in 2' m ' n C c . Thus the transmitted sequences e 
produced by a decoder for C s operating on code sequences r from 2* m - n C c remain code sequences in 2' m ' 
n C c . This means that the minimum squared distance between the possible transmitted sequences e is still 
d"& in (2"™C C ), and that the coding gain 7c <2* m ' n C c ) = 7c (C c ) is still achieved. It also means that a 

45 decoder for 2 ,m * n C c can be used at the receiver to estimate e. 

Referring to Fig. 8, in general, the code C c is a general coset code based on a partition VA C of N- 
dimensional binary lattices, with N even, and a rate-kc/(k c + r c ) encoder E c 112. If the depth of A c is u c = 
then the informality k(A c ') satisfies 2 k < A c' > = |A C '/R"Z N |, and the informativity k(C c ) of the code C c is k- 
(C c ) = k c + k(A c '). If the code C c is replaced by the scaled code R m ^C c for nrteO, then k(C c ) data bits 114 

50 per N dimensions select a coseTl16 of R' m Z N in some coset of FT^Ac 110. A further mN/2 uncoded bits 
per N dimensions 118 select a coset of Z N in the selected coset of R* m Z N . Finally, if the redundancy of the 
■ shaping code is r(C s ) = r s + r(A s ), so that 2'<&S) = |Z N /A S ||A S /A 0s |. then a final r(C s ) bits 122 per N 
dimensions select Tcoset of the time-zero lattice A 0s in the selected coset of Z N , which determines a point 
r, 124 in a predetermined fundamental region R(A 0s ) of the time-zero lattice. Thus the initial constellation 

55 Cmit consists of the IR-^Ae/AosI = 2 k <£c c— £ s > points of R*™A c + a that lie within R(A 0s ). A 
sequence r 126 of such points is mapped into the sequence e 128 congruent to r mod C s that lies within 
the Voronoi region of C s . The final constellation C fin is then the set of nominally |Ac/A s | = 2 k " 
<C C )+r c +mN/2«r( Q S ) + k points of A c + a that lie within the Voronoi region R V (A S ) of A s . The consteila- 
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tion expansion ratio in N dimensions is thus 2 r c* k s, because the system transmits k(C c ) + mN/2 + r(C s ) 
bits per N dimensions. 

The constituent 2D final constellation C 2 ,u n is a set of points from a coset of A 2c that lie within the 
Voronoi region R v (A 2s ) of A 2c , where A 2c is the constituent 2D lattice of A c , and A 2s ' is the constituent 2D 
5 sublattice of A s *7Thus the size |C 2 ,, fn | of C 2(fJn is nominally |A 2c /A 2s '| = IR^cZ^sZ 2 ) = 2 (m+u c +lA s ,w/2 , 
where u s is the depth of A s '. Since k(C c ) + r(C c ) = u c N/2 and k(Ce) + r(C s ) = u s N/2, the constellation 
expansion ratio in two dimensions is 2 2rr £ c vn.^ C s yw = 2 ^9 C >'2 X < <T S ) = CER C (C C )*CER S (C S ). 

w Algebraic Theory, Duality, and Error Propagation 

In this section we develop elements of an algebraic theory of lattice-type trellis codes which applies to 
both coding and shaping operations. In both cases, we are interested in preventing unlimited error 
propagation at the receiver, which means that all elements of the receiver involved in the inverse map from 
rs the estimated e to the corresponding bit sequences (e.g., blocks 100, 104 of Fig. 7) should be feedbackfree. 
The question of when feedbackfree inverses exist is essentially algebraic. 

The algebraic theory also naturally introduces notions of duality; the elements of a shaping code are 
inherently dual to those of a coding code. 

20 

Algebraic Theory of Convolutional Codes 



Trellis Constellations 

25 

A rate-k(k + r) binary convolutional code C is generated by a k-input, (k + r)-output finite-state sequential 
circuit, called an encoder E, that is linear over the binary field F = GF(2). The inputs to such a circuit are a 
sequence x of binary k-tuples, where x is an element of (F k )°° and the outputs are a sequence y of binary 

30 (k + r)-tuples, where y is an element of (F k + r )°° t called a code sequence. The code C is the set of all 
possible such code sequences. 

Because the encoder is linear over F = GF(2), it may be characterized by a k-by-(k + r) generator 
matrix G, consisting of k generators g if 1^k. The generator gj is the output code sequence of (k + r)-tuples 
if the inputs are all-zero except for a single 1 on the i lh input at time zero i.e., gs is the response to a unit 

35 sequence 1 = (1,0,0 ...) on the i lh input. Because of linearity and time-invariance, the code sequence y 
corresponding to any input sequence x is then y = £i^k Xjgj = xG. (Here sequence multiplication is 
defined in the standard way as convolution; under this definition, F°° is a ring, and indeed a principal ideal 
domain.) 

Because the encoder is finite-state, the elements of G must be sequences that can be expressed as a 
40 ratio of two polynomials a(D)/b(D), in D-transform notation — i.e., that are elements of the ring of rational 
functions F(D), where F = GF(2). If all elements of G are actually polynomials — i.e., are elements of the 
ring of polynomials F[D] - then G is said to be polynomial, or feedbackfree. 

There are two canonical forms for encoders for a given convolutional code C. A minimal encoder is one 
with a generator matrix G for which 
45 (a) all elements of G are polynomials, 

(b) the k-by-k minors of G (determinants of all k-by-k submatrices) have no common factor; 

(c) the high-order coefficient matrix of G is nonsingular over GF(2). Because a minimal encoder has a 
polynomial G, it has an obvious feedbackfree realization involving k shift registers of lengths v u 1^iSk, where 
v x is the degree of the generator g it namely the maximum degree of the k + r polynomial elements of g ( The 

so overall constraint length is v = Zv it which is also the maximum degree of any k-by-k minor of G, and the 
encoder has 2 V states. It can be shown that 2 V is the least number of states in any encoder that generates 
the code C. 

A second canonical form is a systematic encoder, which is an encoder with a generator matrix G which 
has a k-by-k submatrix which is an identity matrix l k . By permutation of the order of the outputs, such a G 
55 can be written as G = [l k P] B where the k-by-r parity-check matrix P in general consists of rational functions 
and is not feedbackfree. 

For example, a general rate-1/2 code C has a minimal encoder E defined by the generator matrix G - 
[gi(D), g 2 (D)], where gi(D) and g 2 (D) are two polynomials with no common factor. It can be realized by a 
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single shift register of length r, where v = max [cleg gi(D), deg g 2 (D)]. The corresponding systematic 
encoder is G = [1,g2(D)/gi(D)], which can also be realized by a circuit with v memory elements with 
feedback* 

If x and y are sequences of binary n-tuples -i.e., elements of (F n )°° « their inner product (x, y) is 
5 defined as Ei<sjSn x,y,, which is a binary sequence i.e., an element of F°°. Two such sequences are 
orthogonal if their inner product is the zero sequence. The dual code C A to a rate-k/(k + r) convolutional 
code C is then the set of all sequences of binary (k + r)-tuples that are orthogonal to all code sequences in 
C. 

A syndrome-former H for a rate-k/(k + r) code C is a set of r sequences of (k + r)-tupels hi., 1£i $r, such 
io that the inner product (y, h) of any code sequence y in C with any of the syndrome-former sequences hi . 
is zero. This occurs if and only if GH T = 0, where the (k + r)-by-r matrix H T is the transpose of H » i.e., if 
and only if the k-by-r matrix of inner products (g h hi',) is all-zero. The r-by-(k + r) matrix H is the generator 
matrix for a rate-f/(k + r) convolutional code which is the dual code C A to C. (As an encoder, H has r inputs 
and k + r outputs; as a syndrome-former, H T has k + r inputs and r outputs.) 
15 If G * [l k P] is the generator matrix of a systematic encoder for C. then H = [P T l r ] is the generator 
matrix of a systematic encoder for C A . H can also be converted to minimal (feedbackfree) form. Its r-by-r 
minors are identical to the k-by-k minors of G, and H can be realized with the same number , of memory 
elements as G. (If G is the generator matrix for a rate-k/(k + 1) code C. then H is the generator matrix for a 
code C A whose rate is 1/(k + l), and its single generator h completely determines C). 
20 For example, for a rate- 1/2 code C, if G - [gi(D) f g 2 (D)], then H = [92(D). g,(D)] (minimal form), or H 

= [92(D)/gi(D), 1] (systematic form) [or H = [1. gi(D)/g 2 (D)] (alternative systematic form)]. It is. readily 
verified that for any of these forms GH T = 0. Note (from the minimal form) that the dual C of a rate-1/2 
code C is the same code, except with the two outputs interchanged, so that rate-1/2 codes are effectively 
self-dual. 

25 A minimal syndrome-former H can be augmented with combinational circuitry (so that there is no 
increase in the number of states) to become a (k + r)-input. (k + r)-output circuit, with k additional generators 
(g~ 1 )I . r+ iSiSk + r, that form a k-by-(k + r) polynomial matrix (G" 1 ) 1 " such that GG" 1 = l k ; i.e., the (k + r)-by-k 
matrix G" 1 is a right inverse for G. Furthermore, the (k + r)-by-(k + r) polynomial matrix A = [G~[H ] has 
determinant 1, so that it is invertible, with polynomial inverse A" 1 = [G T H —1 ] T . (Such an invertible 

30 polynomial matrix is called a (k + r)-by-(k + r) scrambler.) 



Ungerboeck Code Example 



35 



40 



45 



For example, the Ungerboeck 4-state 2D code Cu is the mod-2 trellis code that is generated by the 
rate-1/2 4-state convolutional code C whose generator matrix is G - [1+D 2 , 1+D + D 2 ]. The minimal 
syndrome former for C is H T = [1 +D + DM +D 2 ] T , and a right inverse of G is G" 1 = [D 2 , 1 +D + D 2 ] . The 
matrix 



A = 



1+D+D 2 



1+D+D' 
1 + D 2 



has determinant 1 and inverse 

= G 

55 

Referring to Fig. 9, encoder G 130 and the combined syndrome-former/right inverse 132 A = [G" 1 H T ] 
4-state feedbackfree circuits. 



50 



-1 



1+D' 



1+D+D' 



1+D+D' 
~2 
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10 



Let z be any sequence of (k + r)-tuples. Then zA = [zG~\ zH T ] = [x(z), s(z)] is a sequence of (k + r)- 
tuples composed of input k-tuples x(z) and syndrome r-tuptes s(z), In turn, z can be recovered from [x(z), s- 
(z)] by 

z = [x(z), s(z)]A- 1 = x(z)G + s(z)(H- l ) T = V(z) + t(z), 

where y(z) = x(z)G = zG-'G is a code sequence in C, and t(z) = s(z)(H" 1 ) T = zH T (H* 1 ) T = z(H~ 1 H) T is a 
coset representative of the coset of the code C that is identified by the syndrome s(z). The (k + r)-by-(k + r) 
matrices G~ 1 G and (H -1 H) T are orthogonal projection operators that project sequences in (F k + r ) ot> , regarded 
as a linear space (free F°°-module) of dimension k + r over the ring F 00 , onto orthogonal linear subspaces of 
dimensions k and r, respectively. This is a coset decomposition (F k+r )°° = C + [(F k+r ) co /C] in which the set 
of [(F k+r )°°/C] of coset representatives t(z) is itself a linear subspace of (F k + r )°°. 
For example, for the 4-state rate-1/2 Ungerboeck example code, 



75 



20 



25 



G = [1+D 2 , 1+D+D 2 ]; H = [ 1+D+D 2 , 1+D 2 j 
H-l = [1+D+D 2 ,D^]T ; g -1 = [D 2 # 1+D+D 2]T ; 



G" 1 G = 



2 4 
D -hD 



1+D+D 3 +D 4 ^ 



2 3 An 
D Z -hD J +D* 1 



(H X H) T = r 



1+D^+D n 
1+D+D 3 +D 4 



^ 1+D 2 +D 4 ; J 



r-2 3 
r D +D +D 



L D 2 + D 4 



Note that both G _1 G and (H -1 H) T have rank 1, and that G^G + (H _1 H) T = l 2 . 

Because a convolutional code C is a group under sequence addition over F, we may define two binary 
(k + r)-tuple sequences as congruent mod C if their difference is an element of the convolutional code C, 

30 and the cosets of a convolutional code C as the sets of congruence classes mod C. It is easy to see that a 
coset of C is the set of binary (k + r)-tuple sequences z such that zH T = s, where s is given r-tuple 
sequence, called the syndrome sequence. If t = s(H~ 1 ) T , then from the above, the (k + r)-tuple sequence t is 
a coset representative sequence for the coset C + t that is associated with the syndrome sequence s. The r- 
by-(k + r) matrix (H~ 1 ) T may therefore be used to specify a coset representative generator circuit, which, 

35 given a syndrome sequence s, generates a corresponding coset representative sequence t. 

For the example code, (H" 1 ) T = [1 + D + D 2 ,D 2 ] is such a coset representative generator circuit. If we do 
not care about feedback, then we may also consider the circuits [1/(1 + D + D 2 ), 0] or [0, 1/(1 +D 2 )] as coset 
representative generator circuits (H _1 ) T ( since the transpose of either of these matrices is a right inverse for 
H - [1 + D + D 2 , 1 + D 2 )]; then the elements tj of the coset representative sequence t are always in the set 

40 {[0, 0] t [1, 0]} or the set {[0, 0], [0, 1]}, respectively. Thus by 'preceding 1 the syndrome sequence s by 
multipiication by 1/(1 + D + D 2 ) or 1/(1 +D 2 ), we obtain a sequence t of coset representatives that is always 
one of a set of coset representatives for the time-zero block code Co, which in this case is the (2,1) 
repetition code. 

In general, by use of a linear transformation involving feedback, the elements tj can always be confined 
45 to a set of 2 r coset representatives for the time-zero code Co in the set of all binary (k + r)-tupies. (Proof: at 
feast one of the r-by-r submatrices of H is invertible, since the greatest common divisor of the r-by-r minors 
of a minimal encoder H is 1; the inverse of this matrix can be used as the precoder,) 



50 



Algebraic Theory of Mod-2 Trellis Codes 



Since mod-2 trellis codes are isomorphic to convolutional codes, their algebraic theory is essentially the 
same as that of convolutional codes. 
55 Specifically, we have noted that a mod-2 trellis code C can always be regarded as being based on the 
lattice partition A/a' = Z N /22 N , and may be defined as the set of all integer N-tuple sequences c that are 
congruent mod 2 to code sequences y in some rate-k/N convolutional code C, so we may denote C as C- 
(Z N /22 N ;C). ~ " 
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The theory of the previous section then follows unchanged. A minor modification must be made in the 
definitions of inner product, orthogonality, and duality so as to apply to integer sequences rather than binary 
sequences, but as we shall now show, the required change does not affect the isomorphism. 

If x and y sequences of real N-tuples - i.e., elements of (R N )°° - their inner product (x, y) is defined as 
the real sequence dsiSNXiyi, an element of R°°. If x and y are sequences of integer N-tuples, then their 
inner product (x, y) is an integer sequence. Two sequences are strictly orthogonal only if their inner product 
is the zero sequence. However, they are said to be orthogonal mod 2 if their inner product is any even 
integer sequence i.e., an element of (2Z)°°. 

The dual trellis code C A to a mod-2 trellis code C may then be defined as the set of all sequences of 
real N-tuples that are orthogonal mod 2 to all code sequences in C. As a trivial example, if C is the set (Z N )- 
00 of all sequences of integer N-tuples, than C A is (22 w )°°, and vice versa. Since every mod-2 trellis code C 
includes the set (2Z N )°°, its dual C A must be a subset of (Z N )°°; since (2Z N )°° is orthogonal mod 2 to any 
integer sequence, the dual C A must include include (2Z N )°°. 

It is easy to see that if C is generated by the rate-k/N code C, then C A is the mod-2 trellis code that is 
generated by the dual rate-r7N code C A to C, where r = N-k. First, as we have just seen, C must be a set 
of integer N-tuple sequences. Second, if c is a sequence in C that is congruent mod 2 to a sequence y in 
C, let d then be any sequence of integer N-tupies, and let z be the sequence of binary N-tuples to which d 
is congruent mod 2. Then the inner product (c,d) is congruent mod 2 to the inner product (y,z), which isjthe 
all-zero sequence if and only if z is a sequence in the dual code C A . Thus (c,d) is an element of (2Z) if 
and only if d is in the mod-2 trellis code generated by C A namely C , QED. 

A mod-2 trellis code C is a group under real sequence addition, and its cosets are the sets* of 
congruence classes mod C"lf r is any sequence of real N-tuples with elements rj which are elements of R N . 
then there is a unique sequence z of binary N-tuples with elements zj in {0, 1 } and a unique sequence f of 
fractional N-tuples with elements f, in [0, 1) such that r is congruent to z + f mod 2. A coset of C may then 
be specified by a binary r-tuple syndrome sequence s and a fractional sequence f, and consists of all r 
congruent to z + f mod 2 such that zH T = s, where H T is a syndrome-former for the code C that generates 
C. For if r is any such sequence and c is any sequence in C that is congruent mod 2 to a code sequence y 
in C, then r + c is congruent to (z + y) + f mod 2, where the sequence z and y may be added modulo 2, and 
since z + y is a binary N-tuple sequence in the same coset of C as z, it has the same syndrome sequence 
(z + y)H T = zH T = s. 

Generation of Trellis Constellations Based on Mod-2 Trellis Codes 



It follows that a trellis constellation based on a mod-2 trellis code C(Z N /2Z N ;C) may be generated as 

f0ll °First. let A be a binary lattice which is a superlattice of Z N « e.g.. 2' m Z N , R* m Z N , for m * 0. If [A/Z N ] is 
the order of the partition |A/Z N |. then there are [A/Z N ] elements of any coset A + a of A in any fundamental 
region of Z N , and in particular in the half-open N-cube [0,1 ) N . A fractional sequence f may then be specified 
by a sequence of binary (log 2 [A/Z N j)-tuples -- i.e., by log 2 [A/Z N ] bits per N dimensions. 

Second, a syndrome sequence s may be specified as a sequence of binary r-tuples, where r = N-k is 
the redundancy of the rate-k/N convolutionai code C. The syndrome sequence s may be converted to a 
coset representative sequence t = s(hr') T by an r-input. N-output coset representative generator circuit 
(H -i)T where the N - b y- r matrix H" 1 is a right inverse to an r-by-N minimal generator matrix H of the dual 
rate-r/N code C A to C. (If desired, all but r of these outputs may be constrained to zero.) 

Referring to Fig. 10, the sequence r = t + f 140, with t now regarded as a sequence of integer N-tuples, 
is then a coset representative for a coset of the trellis code C. A minimum-distance decoder 142 for C can 
then find the sequence e of least average power that is congruent to r mod C, as we have seen before. 

If we are not concerned about feedback (and there is no reason that we should be in the transmitter), 
then (H-y 144 can be chosen so that the output N-tuples tj 146 of the coset representative generator 
circuit are coset representatives for the 2 r cosets of the time-zero code Co in the set (N, N) of all binary N- 
tuples, and thus, as integer N-tuples, also coset representatives for the 2 r cosets of the time-zero lattice A 0 
in Z N , rather than as cosets of a' (as shown in Fig. 10). 

Referring to Fig. 11, let e + n be the received sequence 150, where n is a noise sequence introduced by 
the communications channel. The receiver first generates an estimated sequence e 152, which in the 
absence of coding may simply be the result of a symbol-by-symbol decision as to which element of A + a is 
closest to each received N-tuple ej + nj. Given adequate signal-to-noise margin, most of these symbol 
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decisions will be correct, but we must assume that there will occasionally be an error. Therefore the 
receiver must be able to recover the original bit sequences in such a way that (a) if e = e, then the correct 
bit sequences are recovered: (b) if e j is not equal to ej only for a single symbol, or more generally only for 
a short time, then the correct bit sequences are recovered except for a short time starting at the time of first 
5 error. In other words, error propagation must be limited. 

As needed above, the sequence of elements of e mod Z N is the same as that of r, namely the fractional 
sequence f. Therefore we may recover the elements of f by the symbol-by-symbol operation f, = ej mod 
Z N 154; i.e., by taking the fractional parts of the estimated sequence e\ In turn, each f/ determines a 
corresponding binary (log2 [A/Z N ])-tuple. There can be errors in f/ only where there are errors in ej', so 
70 error propagation is totally absent. 

The sequence t of integer parts t/ = ej' -f/ of the received sequence e' may be passed through a 
syndrome-former H T to form an estimated syndrome sequence s = t H T , whose elements Sj are binary r- 
tuples that are estimates of the corresponding input r-tuples Sj. If H is chosen to be feedbackfree, as is 
always possible, then a brief run of errors in e{ and thus possibly t/ can lead only to a somewhat longer run 
is of errors in S] . In fact, errors in Sj can occur only for 5 = max deg hjj(D) time units after the last error in ej , 
where 5 is the maximum degree of any polynomial in the r-by-N matrix H. Thus some error propagation 
may occur, but it is strictly limited. 

20 General Construction for Lattice-Type Trellis Codes with Trellis Constellations 

We are now able to show a general construction that combines a lattice-type trellis code C c used for 
coding with a mod-2 trellis code C s used for shaping. The construction is completely symmetrical, and 

25 nicely illustrates the duality between coding and shaping. 

Referring to Fig. 12, assume that the code C c is based on a partition Ac/A c of binary N-dimensional 
lattices, where N is even, the order of the partition is [Ac/A c '] = 2 k c + r c, and the depth of the lattice A c ' is u c 
(called simply u in this section). The code C c is scaled by R* m * u , where m£0, so that R- m Z N is a sublattice of 
R* m A c ' of order 2 k(A c ' >, where k(A c ') is the informativity of A c \ The (k c + r c )-bit labels for the cosets of A c ' 

30 in A c are generated by a rate-kc/(k c + r c ) encoder E c for the code C c with generator matrix G c 150. Since A c 
and A c ' are binary lattices, we may if we like take A c as R- m - u z N and A c ' as R* m Z N , in which case the 
encoder has k c + k( A c ') = k(C c ) inputs 152 and k c + r c + k(A c ') + r(A c ) = u.N/2 outputs. 

The code C s is based on a partition A s /A s ' of mod-2 N-dimensional lattices, where the order of the 
partition is [A s /A s '] = 2 k s + r s, the depth of the lattice A s ' is u s = 2, and the order of the partition [Z N /A S ] is 

35 2 r(A s\ where r(A s ) is the redundancy of A s . The (k s + r s )-bit labels for the cosets of A s ' in A s are generated by 
a rate-r s /(k s + r s ) coset representative label generator with generator matrix (H~ 1 )T T 160. Since A s and A s are 
mod-2 lattices, we may if we like take A s as Z and A s as 2Z N , in which case the coset representative label 
generator has r s + r(A s ) = r(C s ) inputs 162 and k s + r s + k(A s ') + r(A s ) = N outputs 164. 

The central (scaling) section 166 merely uses mN/2 uncoded bits 168 to identify a coset of Z N in R- 

40 m Z N , where m may be any non-negative integer. 

Thus a total of k(C c ) + mN/2 + r(C s ) bits enter the system per N dimensions and are converted into a 
sequence of multi-part labels that specify a sequence of cosets of 2Z N in R" m ' u Z N , or equivalently a coset 
representative sequence r 170, where each rj is an element of a set of 2 m + u + 2 coset representatives for the 
cosets of 2Z N in R- m -"z N . A decoder for C s 172 then finds the sequence e 174 of least average power in the 

45 set of sequences that are congruent to r mod C s . 

At the receiver, a decoder for C c 176 operates on e + n, where n is an additive white Gaussian noise 
sequence, to find the closest code sequence e in C c . The components of e t of e' then determine the coset 
representative sequence r . The components rj ofY may be mapped back into corresponding multi-part 
labels, which comprise estimates of the various bit sequences at the transmitter. The sequence t' 178 of 

so binary N-tuples representing cosets of 2Z N in Z N is passed through a feedbackfree syndrome-former Hg 180 
to form a sequence s' = tH\ of estimated syndrome r(C s )-tuples 181. The sequence of (mN/2)-tuples 
representing cosets of Z N in R* m z N is taken directly as an estimate of the corresponding input (mN/2)-tuples 
182. And the sequence of binary (uN/2)-tuples 184 representing cosets of R' m z N in R- m *"Z N is passed 
through a feedbackfree right inverse G^ 1 186 to form a sequence of estimated information k(C c )-tuples. 

55 Because all maps at the receiver are feedbackfree, error propagation is limited. ~" 

This picture shows that coding with C c and shaping with C s are dual in many ways. The encoder G c in 
the transmitter has k(C c ) inputs; the encoder Hi for the dual code is used in the receiver as a syndrome- 
former with r(C s ) outputs. (The informativity k(C) and redundancy r(C) of a code C are dual quantities.) The 
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corresponding right inverses G" 1 and (H; 1 ) t are similarly in dual positions, with (H^' ) t used as a r(C s )-input 
coset representative sequence generator. Correspondingly the constellation expansion ratios in two dimen- 
sions due to coding and shaping are CER c (C e ) = 2"< £ c > and CER S (C S ) = 2*< £ s respectively. The 
main complexity of the coding and shaping operations lies in the decoder for C c and the decoder for C s . 
respectively, which operate respectively on" the output of the transmitter and the input to the receiver. Apart 
from the fact that we have restricted the shaping code C s to be linear, therefore, we have a completely 
symmetrical and dual picture. 



Wei Code Example of Combined Coding and Shaping in Four Dimensions 



The Wei 4D codes (Wei, "Trellis-Coded Modulation with Multidimensional Constellations," IEEE Trans. 
Inform. Theory. Vol. IT-33. pp. 483-501. 1987) have good practical characteristics. In particular, there are 8- 

75 state and 16-state codes based on the partition Z*/RD* and rate-2/3 convolutional encoders that achieve 
d 2 . _ 4 witn on)y one re dundant bit per four dimensions, so that the normalized redundancy p is 
1/2™ and a nominal (fundamental) coding gain of y c = 2""d * in = 2 3 ' 2 (4.52 dB) is obtained with a 
coding constellation expansion ratio of only CER C = 2" = 2 ,/a = 1.414. The 16-state code has a 
normalized error coefficient of only No = 12, whereas the 8 state code has No = 44. These codes are 

20 mod-2 (and depth-2) codes and may equivalents be regarded as being based on the partition. Z /2Z with 
augmented rate-3/4 convolutional codes C. 

The duals to the Wei codes are good candidates for shaping codes. The dual Wei 4D codes may be 
regarded as being based on the partition ZV2Z* with rate-1/4 convolutional codes C^ with the same 
number of states as their dual rate-3/4 codes C. Their normalized informativity x is 1/2, and thus their 

25 shaping constellation expansion ratio is CER S = 2" = 2 1S = 1.414. 

Referring to Fig. 13, Wei code C 0 for coding is scaled by R- m2 so that it is based on the partition R" " 
2 Z*/R- m Z* 190. The rate-3/4 binary encoder G c for C 192 is used in the transmitter for encoding, and a 
feedbackfree 4x3 right inverse 194 is used in the receiver to recover the input information. 3-tuples 196. 
The 4x3 syndrome-former H[ for the dual code C A 198 is the transpose of a minima! encoder G c for .ts dual 

30 C and the 3x4 coset representative generator (H s ' ) T 200 is the transpose of a corresponding, right inverse 
Gc' (though it need not be feedbackfree). The total number of bits 202 transmitted is 2m + 6 per four 
dimensions, or m + 3 bits per two dimensions, so that any integer number of bits per two dimensions (not 
less than three) may be transmitted. The final constellation (in two dimensions) consists of the nominally 
2m + 4 points of a translate of R^Z? that lie in the Voronoi region of 2Z 2 , and the total constellation 

35 expansion ratio is thus only 2, of which 2 ,s is due to coding and 2 ,/2 to shaping. (This compares to a ratio 
of 4 in the 2D Ungerboeck example.) 

Minimal rate-3/4 generator matrices G c and right inverses Gc' for the 8-state and 16-state Wei codes C 
are shown in Figs. 14 and 15. The transposes of these matrices are syndrome-former matrices H s and 
coset representative generator matrices (H s ' ) T for the dual Wei codes C A For reference, the minimal rate- 

40 1/4 generator matrix G s = H c of the dual code C A and a feedbackfree right inverse G s = H c are also 
given for each case. 

For both of these examples, the code C generated by G c has Hamming distance d H - 4. The final 
generator ([1111]) has no memory and serves simply to select one of the two cosets of 2Z in RD* (by 
selecting one of the two codewords of the (4,1) repetition code). The 8-state code has 5 we.ght-4 words. 

45 one of length 1 ([1111]) and four of length 2 ([1100.1010] and their complements); the 16-state code has 
one weight-4 word ([1111]). For both, the time-zero block code is the (4.3) single-parity-check code n both 
cases, the code C A generated by H c has the (4.1) code as its time-zero block code, and all 4-tuples are in 
the (4 3) single-parity-check code (so all selected integer 4-tuples are in D*). 

By simulation, we have determined that the 8-state dual Wei code chieves about 0.90 db shape gam for 

so M = 7 (a decoding window of 16 QAM symbols), and 1 .05 db as M approaches infinity, better asymptotically 
than the 4-state Ungerboeck code, and with much reduced constellation expansion. 



Constellation Boundary Constraints 

In most practical applications, it is desirable to keep the final constellation size small and the 
constellation boundary circular in order to reduce the peak-to-average ratio (PAR). This will improve the 
performance against impairments such as nonlinear distortion or phase jitter. 
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In trellis shaping using trellis codes that are based on partitions of binary lattices, the elements ej of the 
transmitted sequence belong to a 2D constellation that lies within a square boundary. Trellis shaping 
expands the size of the 2D constellation by a factor of 2 k( - s> . It is possible to reduce the constellation 
expansion, and obtain a more circular boundary by applying constraints to the decoder 46 (Fig. 4). For 

s example, the decoder can be constrained in such a way that it only selects code sequences c (from the 
shaping code C s ) which result in transmitted sequences e = r-c whose elements ej = rpCj have magnitudes no 
greater than some predetermined radius R c ; i.e., |ej| = |r r Cj|^R c . When R c is reasonably large (e.g., R c is 
large enough that for every state at least one branch is always allowed), the decoder can proceed without 
violating the constraint. However, if R c is too small, it may be forced to select a code sequence that does 

io not satisfy this constraint. 

This type of constraint can be incorporated into a Viterbi-type decoder very easily by deemphasizing 
those branches of the trellis whose minimum mean-squared error (MSE) metrics exceed R§ , by multiplying 
them with an appropriately chosen number Q. The larger Q gets, the harder the constraint becomes. This 
reduces the probability of choosing points lying outside the constraint circle. When Q is sufficiently large, 

75 and R c is not very small, points outside the constraint circle will never be selected. Note that if R c is too 
small, one should not disallow branches, because this may force the decoder to choose an illegal sequence 
as a code sequence, and then the initial sequence r cannot be correctly recovered in the receiver. 

When the transmitted sequence e is filtered before transmission over the channel, it may be beneficial 
to apply the constraint in higher dimensions (on groups of successive elements of the sequence e), in order 

20 to minimize the PAR after the filtering. 

For example, the constraint may be applied four dimensionally. This can be incorporated into the 
decoder (for decoding the 8-state dual Wei code) as follows: First, compute 4D branch metrics as usual 
based on mean-squared error; then compare these against the threshold R§ . Branch metrics which exceed 
the threshold are multiplied by some factor Q (typically a power of 2) to deemphasize the associated 

25 branches. When Q is sufficiently large, the decoder will avoid selecting these branches, unless it is forced 
to do so (that is, all paths have been deemphasized). Again, if R§ is large enough that for every state at 
least one branch is allowed, for large enough Q, the decoder will never pick two successive symbols e 2j and 
e 2 j + i with total energy greater than R 2 ; i.e., |e 2i | 2 + |e 2 j +1 | 2 ^R§ . 



Modem Implementation of 8-state 4D Dual Wei Code 

We now show how to use a four-dimensional 16-state Wei code for coding (denoted as C c ) and a four- 
dimensional 8-state dual Wei code for shaping (denoted as C s ) and send 7 bits per two dimensions. The 

35 code C s is based on a rate-1/4 convolutional encoder and the partition ZV2Z 4 . It has the time-zero lattice A 0 
= RD+. The code C c is based on a rate-3/4 convolutional encoder and it is scaled such that it uses the 
partition 2~ 3 ZV2~ 2 Z 4 \ It adds one redundant bit every four dimensions. Note that a fundamental region of 
the time-zero lattice RD* of Cs will contain exactly 2 2x7 + 1 points from any coset of the scaled lattice 2~ 3 Z 4 ; 
i.e., |2- 3 ZVRD*| = 2 1S . ~ 

40 Referring to Fig. 17, in a modem transmitter 210, in each (2D) signaling interval a scrambler 236 
receives 7 bits from a data terminal equipment (DTE) 238. The scrambled bits are delivered to a binary 
encoder 240 in groups of fourteen so-called l-bits 242, labeled I6(2j) through I0(2j) and I6(2j + 1) through 10- 
(2j + 1 ), at two successive times 2j and 2j + 1 . 

Referring to Fig. 18, and l-bits I2(2j)l1(2j) are taken to represent an integer mod 4 and differentially 

45 encoded by a coding differential encoder 244 to produce two differentially encoded bits ID2(2j)ID1(2j) 
according to the relation 

ID2(2j)ID1(2j) = l2(2j) = l2(2j)l1(2j) e 4 ID2(2j-2)ID1(2j-2) 
where ©4 represents mod-4 addition. 

The bits ID1 (2j) and I0(2j) enter a rate-2/3 binary convolutional encoder 246, which . produces a 
so redundant bit Y0(2j). Referring to Fig. 19, encoder 246 includes a sequential circuit having four delay-2 shift 
registers 248 and three mod-2 adders 250. This encoder can also be realized using other circuits. 

Referring again to Fig. 18, the bits ID2(2j)ID1(2j)IO(2j) and the redundant bit Y0(2j) enter a bit converter 
252 that generates four output bits Z1 (2j)Z0(2j)Z1 (2j + 1)Z0(2j + 1) according to the following table: 
ID2(2j) ID1(2j) I0(2j) Y0(2j) Z1(2j + 1) Z0(2j + 1) Z1(2j) Z0(2j) 



30 
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20 These output bits will be used to select one of 16 cosets of 2~ 2 Z* whose union form the four-dimensional 
grid 2" 3 Z* + (2-\2"\2" 4 ,2-*). Each such coset can be represented by a pair of cosets of 2~ 2 Z 2 in the two- 
dimensional grid'2- 3 Z 2 + (2-*,2-*). These will be determined independently by Z1(2j)Z0(2j) and Z1(2j + 1)- 
Z0(2j + 1). Note that the differential encoder 244, encoder circuitry 246, and the bit converter 252 together 
form the rate 3/4 binary encoder G c 192 of Figure 13. 

25 The remaining l-bits I3(2j) through I6(2j) pass through unchanged, are relabeled as Z2(2j) through Z5(2j) 
and are combined with Z1(2j)Z0(2j) to form the six-tuple Z5(2j)...Z1(2j)Z0<2j). Similarly, the I bits I3(2j + 1) 
through I6(2j + 1 ) together with Z1 (2j + 1 )Z0(2j + 1 ) form the six-tuple Z5(2j + 1 )...Z1 (2j + 1 )Z0(2j + 1 ). 

Referring to Fig. 20, the l-bits I2(2j + 1)l1(2j + 1)IO(2j + 1 enter a rate-3/4 inverse syndrome former (H 1 ) T 
254 which produces two pairs of S-bits; S1(2j)S0(2j) and S1 <2j + 1 )S0(2j + 1 ). As discussed earlier, (H 1 ) T is 

30 included to limit error propagation. 

The 16 bits generated in this manner are mapped into two initial signal points r(2j) and r(2j + 1 ^chosen 
from a 256-point two-dimensional square signal set which lies on the grid 2" 3 Z 2 + (2-\2- 4 ). First.'the six- 
tuple Z-bits are used to select a signal point from the 64-point quadrant-1 signal constellation shown in Fig. 
21. This signal set is divided into four subsets (262, 264, 266, 268) which are cosets of 2 2 Z 2 . These are 

35 indicated by different shadings in Fig. 21. The signal points are labeled such that Z1 and Z0 select one of 
the cosets according to the labeling shown in the lower left corner of Fig. 21. The labeling of the remaining 
bits Z5Z4Z3Z2 is shown only for coset 262. For other cosets, the labeling can be obtained from the rule 
that 90 degree rotation of a point around the center (1/2, 1/2) of the quadrant-1 signal points results in the 
same labeling. 

40 The Wei encoder ensures that the minimum distance between allowable sequences of quadrant-1 
points is kept large. 

The S-bits S1S0 determine the final quadrant of the initial point according to the rule: 



S1S0 


Quadrant 


00 


1 


01 


4 


10 


2 


11 


3 



50 

Referring to Fig. 22, the signal points in quadrant 1 272 are moved to the quadrant chosen by the S-bits 
such that they remain in the same coset of Z 2 . This is done by offseting the quadrant-1 point by (0,0), (0,- 
1), (-1,-1) or (-1,0). It follows that the translated final point is in the same coset of 2~ 2 Z 2 as the quadrant-1 
55 point Therefore, the minimum distance between allowable sequences remains unchanged. The signal 
points obtained in this manner form the initial point sequence r. 

Referring to Fig. 23, the sequence r is transformed into the transmitted sequence e by the decoder 294. 
To understand the function of the decoder 294, we must first consider the shaping code C s . All possible 
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sequences in this code are described by the trellis diagram of Fig. 24. 

In Fig. 22, the code symbols chosen from the two-dimensional lattice Z 2 are indicated by squares, the 
only nine symbols are shown since others are irrelevant. These symbols are partitioned into four subsets 
which correspond to the (four) cosets of 2Z 2 in Z 2 and are labeled as A, B, C and D as shown in the lower- 
s left corner of Fig. 22. 

Referring to Fig. 24, from any state there are two branches, each branch representing two two- 
dimensional cosets. For example, state 0 has two branches labeled A, A and B, B. Each pair of such cosets 
represents a coset of 2Z* in Z 4 . The decoder in Fig. 23 determines, via the Viterbi algorithm, the allowable 
code sequence c which is closest in the sense of squared Euclidean distance to the sequence r of initial 
10 signal points. The decoder operates recursively and in synchronism such that every two signaling intervals 
(or for every recursion) it releases two delayed decisions c(2j-2M) and c(2j + 1-2M), where 2M is the 
decoding delay measured in number of signaling intervals. 

The operation of the Viterbi algorithm is well-known. Here, it is slightly enhanced to insure that the 
computed code sequence c is indeed an allowable sequence from C s , because otherwise the transmitted 
15 sequence cannot be correctly recovered. In each recursion, the Viterbi algorithm determines the most likely 
path and traces it back for 2M signaling intervals to determine the delayed code symbols which in turn 
correspond to a state-transition from some state u to state v, for example. The enhanced Viterbi algorithm 
traces back other paths as well, to determine whether they also end with a state transition that goes to state 
v. A very large metric is assigned to those paths which do not satisfy this requirement. This insures that 
20 they will not become the most likely path in the next recursion and guarantees that only allowable code 
sequences are selected. The enhanced VA may be further modified to include constellation constraints. 

Referring to Fig. 17, the error sequence e = r-c is computed by a subtractor 241 and passed to a QAM 
modulator 243. The resulting pairs of successive QAM symbols are filtered by a transmit filter 245, and 
transmitted to a channel 247. Note that the elements e(2j) and e(2j + 1) of the error sequence lie in the same 
25 coset of Z* ( and thus the same coset of 2~ 2 Z*, in 2~ 3 Z\ as the initial points r(2j) and r(2j + 1). Therefore, 
the minimum distance between possible error sequences is the same as the minimum distance between 
initial sequences, but the required average power is reduced as explained earlier. Also note that the 
elements of the error sequence belong to the 256-point square signal set lying on the grid 2~ 3 Z 2 + - 
(2-* ,2-*). 

30 Referring to Fig. 25, in the receiver 281, a received signal 283 is passed through an equalizer 277, and 
then demodulated 279 to give the received sequence z 280 which approximately corresponds to the 
transmitted error sequence corrupted by noise. First, an estimate of the error sequence e' 278 is obtained 
by using a Viterbi algorithm decoder 281 for the 16-state Wei code C c . This decoder is well-known; of 
course, it may take into account the square boundaries of the 256-point constellation from which the 

35 transmitted signal points are chosen. 

Suppose e (2j) and e'(2j + 1) are the two estimates generated by the Viterbi decoder after a recursion. 
The decisions on the transmitted l-bits can be obtained from these as follows: First, to obtain the Z-bits, e'- 
(2j) and e'(2j + 1) are translated into quadrant 1 ( in such a way that they remain in the same coset of Z 2 . 
Then, the Z-bits can be extracted using an inverse mapping in quadrant 1 . 

40 To obtain the S-bits. we first label the quadrants (as before) according to 



Quadrant 


SQ1SQ0 
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00 
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10 
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11 


4 


01 



Then, we obtain two quadrant labels SQl(2j)SQ0(2j) and SQ1(2j + 1)SQ0(2j + 1). It is easy to show that SQ1- 
(i)SQO(i) = S1(i)SO(i)e 2 B1(i)B0(i), where i=2j or 2j + 1 and © 2 represents an exclusive-or operation. B1(i)B0(i) 
represents a binary two-tuple corresponding to the two-dimensional coset (A = 00, B=11, C = 01 or D = 10) 
of 2Z 2 for the shaping code symbol c(i) chosen by the decoder for C s ; recall that e(i) = r(i)-c(i). Referring to 
Figs. 26 and 27, the labels SQ1(i)SQ0(i) are then passed through a feedback-free rate-4/3 syndrome-former 
H T 256. The syndrome former has the property that any allowable sequence of coset labels B1(i)B0(i) will 
produce an all-zero sequence at its output. Furthermore, (H" 1 ) T H T =I, the identity matrix. Therefore, a 
sequence which first passes through (H~ 1 ) T and then through H T will be unchanged. Thus, at the output of 
the syndrome former we correctly obtain the transmitted bits I2(2j + 1)11 (2j + 1)IO(2j + 1), as long as the 

22 



BNSDOCID: <EP 0383632A2> 



EP 0 383 632 A2 

estimates e'<i) are correct. When there are occasional errors, they do not create catastrophic error 
propagation, because H T is feedback-free. 

Referring again to Fig. 26, the remaining information bits are recovered by an inverse bit converter 286 
and a coding differential decoder 288 which operates according to the relation: 
5 I2(2j)l1 '(2j) = ID2(2j)ID1 '(2j) e* ID2(2j-2)ID1 '(2j-2) 

where e* is a modulo 4 subtraction. Referring to Fig. 25, the l-bits are then descrambled in a descrambler 
285 and delivered to the DTE. 



70 Differential Coding for Shaping 

Suppose the channel introduces a phase rotation of 90k degrees, k = 1, 2, or 3. The translation of the 
error point into quadrant-1 is then rotated around the point (1/2, 1/2) by the same amount. Since the Wei 
code is transparent to 90k degree phase rotations, the mapping used in the transmitter guarantees that the 
2-bits can be correctly recovered. Unfortunately, this is not true for the quadrant labels SQ1SQ0. 

To remedy this situation, the label sequence SQ1SQ0 can be generated according to the phase- 
invariant labeling shown in Fig. 28. In order to guarantee that the relationship SQ1 SQO = S1 S0e 2 B1 BO still 
holds, a differential encoding operation is necessary. This is done as follows: Referring to Fig. 29, for each 
signal point obtained through quadrant-1 mapping 258, we extract its subquadrant label SQ1SQ0 with a 
subquadrant extractor 306 according to Fig. 30. A shaping differential encoder 308 and a map into offsets 
310 use the bits SQ1SQ0 and S1S0 to offset the quadrant-1 point into two new points r0(i) and r1(i) T where i 
= 2j or i = 2j + 1 such that they remain in the same coset of Z 2 . This mapping can be described by the 
following two tables: 
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50 



DiO 


Dil 


A i 
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0 


O.O+jO.O 


0 


1 


0.0-j1.0 
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0 


-1.0+j0.0 


1 


1 
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In the Viterbi algorithm, the point r0(i) is used for branches corresponding to cosets A and B, whereas 
r1(i) is used for branches corresponding to cosets C and D. It can be shown that if SQ1SQ0 is the 
subquadrant label of the received point, then SQ1SQO-S1SO© 2 B1BO. Therefore, by passing the subquadrant 
label (which is rotationally invariant) through the syndrome-former H T we can recover the bits I0(2j + 1), 11- 
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(2j + 1) and I2(2j + 1), even in the presence of 90k degrees phase offset. 



Coding Gain of Wei Embodiment 

5 

The normalized redundance of the coding code is r'(C c ) = 1/2 l so the coding constellation expansion 
ratio is CER c (C c ) = 2 1/2 = 1.414. The normalized informativity of the shaping code is k(C s ) = 1/2, so the 
shaping constellation expansion ratio is CET s (C s ) = 2 1/2 = 1.414. As previously noted, simulations have shown 
that the shape gain of this code exceeds 1.0 d*B as the shaping decoding delay M becomes large. 
10 The combined effects of coding and shaping yield a total constellation expansion ratio of CER S (C S )- 
*CER C (C C ) = 2. The nominal coding gain of this code is 2 3/2 (4.52 dB), and its effective coding gain is about 
4.2 dB, since its normalized error coefficient is 12. Thus, the total coding gain due to the code and the 
constellation is about 5.2 dB. 

15 

Bounds on shape gain of trellis constellations 

As discussed earlier, the constituent 2D constellation C2 of the final constellation C fin of a trellis 
constellation based on a trellis code C(A/a'; C) satisfies the following constraints (where we assume that Z 2 
20 is the constituent 2D lattice of A, and R U Z 2 is the constituent 2D sublattice of a', where u is the depth of the 
partition A/a'): 

1 . The points of C2 are drawn from a translate of a superlattice R' m * u Z 2 of Z 2 ; 

2. Each of the 2 m + tA cosets of Z 2 in r-^z 2 has equal probability 2"™; that is, the sums of the 
probabilities of the points in the 2 u cosets of R U Z 2 whose unions are each one of these cosets are equal; 

25 3. The points of C2 are bounded within the Voronoi region of R U Z 2 (a square of area 2 U ). 

Fig. 16 shows the maximum shape gain T s (in dB) as a function of the normalized informativity x f for tx 
= 1, 2, 3, and 4, and also for unlimited u (u - °°). Since CER S = 2 X , this curve also gives a bound on 
shape gain versus shaping constellation expansion ratio for trellis constellations, for various u. All curves 
start out (from k = 0) along the u = 00 curve, but then diverge as the Voronoi constraints take effect. 

30 However, even for u = 2, shape gains as large as 1.44 dB are in principle obtainable. (The curves are 
obtained by simulations with 16x16 final 2D constellations, so that the values are approximate, and the 
upper ends of the curves are truncated due to boundary effects.) 

For x = 0.5 bits per two dimensions, where the shaping constellation expansion ratio is CER S = 2 1 ' 2 - 
1.4, the maximum attainable shape gain is 1.35 dB, within 0.2 dB of the ultimate limit, and this may be 

35 achieved with a depth u of 2. For x = 0.25 bits per two dimensions, where the constellation expansion ratio 
is 2 1/4 - 1.2, the maximum attainable shape gain is 1.12 dB, also attainable with u = 2. 

These curves suggest that it is not a serious restriction to consider only linear, and thus depth-2, trellis 
codes for shaping. They further suggest that a value of as low as x = 1/2 for normalized informativity, such 
as in the dual Wei 4D codes, may not seriously compromise performance. Indeed, a shape gain of 1.10 dB 

40 is obtained asymptotically with the 16-state dual Wei 4D code, which is within 0.25 dB of the upper bound 
on shape gain for this x. The bound suggests that we could go to even smaller x and thus constellation 
expansion ratio 2 X \ even for x = 1/4, we might be able to achieve of the order of 1 dB in shape gain. 



45 Extension to Nonlinear Trellis Codes 

While the previous section shows that linear (mod-2) trellis codes can achieve most of the shape gain 
possible, we shall now sketch briefly how the methods of this specification may be extended to nonlinear 
trellis codes C. 

50 For known practical nonlinear trellis codes C, there is still an associated time-zero lattice such that for 
any state Sj of the encoder, the set of possible next signal points is a coset Ao + a(sj) of the time-zero lattice 
Ao of C. 

A simple fundamental region of C may then again be specified by a hard-decision decoder for C that is 
based on any fundamental region RcTof the time-zero lattice Ao of C. Given the state Sj at time j and any rj 
55 in R N , this decoder finds the uniq"ue code symbol Cj in A 0 + a(sj) "such that rj = Cj + ej, where ej is an 
element of Ro. The next state Sj + i is then determined by cj. Thus this hard-decision decoder decomposes 
any sequence r in sequence space (R N ) W into a sum r = c + e, where c is in C and e is in (R0) 00 . 

We therefore say that (R0) 00 is a simple fundamental region of the code C. The whole of sequence 
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space (R )°° is covered by the nonoverlapping translates (Ro)°° + c, where c ranges through all code 

sequences in C. /d nvoo 

Consequently we may define the congruence class [e] as the set of all r in sequence space (R ) that 
are mapped to e by the above decoder, where e is any sequence in the simple fundamental region (Ro) 00 . 
This is an appropriate generalization of the ideal of a coset or translate C + e of a linear trellis code C, 
although the congruence classes do not in general form an algebraic group if C is nonlinear. 

The general implementation of a trellis shaping system as shown in Fig. 3 et seq follows as before. The 
data sequence is first mapped into an initial sequence i (r in Fig. 3) that lies in the simple fundamental 
region (Ro) 00 . A decoder for C then finds a code sequence c in C such that the difference sequence x = i-c(e 
in Fig. 3)* has an average power less than or equal to that of the initial sequence i. The difference sequence 
is transmitted. Note that x is in the congruence class [i]. Therefore the initial sequence i can be recovered 
from the transmitted sequence x by using the hard-decision decoder for C based on the fundamental region 
Ro of the time-zero lattice of C that was first described above. 

" In general, we may define a fundamental region of a trellis code C as any region that contains one and 
only one sequence from each congruence class [e], for each e in (Ro) 00 . In the implementation of the 
previous paragraph, the set of possible transmitted sequences x lies within such a fundamental region of C, 
because any decoder for C maps the initial sequence i (which is a sequence in (R 0 )°°) into a transmitted 
sequence x in the congruence class [i]. A minimum-distance decoder for C would map i into the ^sequence 
x of least average power in the congruence class [i]. The set of such sequences x for ail i in (R 0 )°° may be 
defined as the Voronoi region of the code C. 



Hexagonal 2D Constellations 



If we choose the shaping trellis code C(A/a';C) as a code based on a partition a/a of so-called ternary 
or quaternary lattices whose constituent 2D lattice and sublattice A 2 and A 2 are versions of the hexagonal 
2D lattice A 2 . then the final constellation C„„ is bounded within a region R V (A 2 ) which is hexagonal rather 
than square. Such a constellation has the advantage of being more nearly spherical. 



Claims 

1. A method of mapping a digital data sequence into a signal point sequence for data transmission, 
comprising selecting said signal point sequence from a subset of all possible signal point sequences based 
on said digital data sequence, all said possible signal point sequences in said subset lying in a fundamental 
region of a trellis code, said fundamental region being other than a simple Cartesian product of finite- 
dimensional regions. 

2. The method of claim 1 wherein said possible signal point sequences are code sequences from a 
translate of a second code, said second code being of the trellis or lattice type. 

3. The method of claim 1 wherein said fundamental region comprises approximately a Voronoi region of 
said trellis code. 

4. The method of claim 1 wherein said fundamental region comprises the set of said possible signal 
point sequences that are decoded to the zero sequence in said trellis code by some decoder for said code. 

5. The method of claim 4 wherein said fundamental region comprises the set of said possible signal 
point sequences that are decoded to the zero sequence in said trellis code by an approximation to a 
minimum distance decoder for said code. 

6. The method of claim 5 wherein said fundamental region comprises the set of said possible signal 
point sequences that are decoded to the zero sequence in said trellis code by a minimum distance decoder 
with delay M, wherein M is greater than or equal to 1 . 

7. The method of claim 4 wherein said fundamental region comprises the set of said possible signal 
point sequences that are decoded to a common error region by a fair, exhaustive decoder for said trellis 
code. 

8. The method of any preceding claim wherein said selecting comprises 

mapping said digital data sequence into an initial signal point sequence belonging to and representing a 
congruence class of said trellis code, and 

choosing a signal point sequence belonging to said congruence class and which has no greater average 
power than said initial signal point sequence. 
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9. The method of claim 8 wherein said mapping comprises mapping said digital data sequence into a 
sequence of signal points belonging to an initial constellation comprising points lying within a fundamental 
region of a time-zero lattice of said trellis code. 

10. The method of claim 8 wherein said mapping includes applying a portion of the elements of said 
5 digital data sequence to a coset representative generator for forming a larger number of digital elements 

representing a coset representative sequence. 

11. The method of claim 10 wherein said coset representative generator comprises a multiplication of a 
portion of the elements of said digital data sequence by a coset representative generator matrix (H" 1 ) 7 
which is inverse to a syndrome-former matrix H T for said code. 

w 12. The method of claim 11 further comprising recovering the digital data sequence from a possibly 
noise-corrupted version of the signal point sequence, including decoding the signal point sequence to a 
sequence of estimated digital elements and forming a syndrome of fewer digital elements based on a 
portion of the estimated digital elements using a feedback-free syndrome former H T . 

13. A method of mapping a digital data sequence into a sequence of signal points to be sent 

75 comprising 

determining a class of possible sequences from which to select said sequence to be sent, based on digital 
data appearing in said digital data sequence up to a time j, and 

for every time j, selecting said sequence to be sent from said class of possible sequences based on digital 
data appearing in said sequence after time j. 
20 14. The method of claim 13 in which 

the digital data sequence comprises a succession of data elements, and 

the step of determining the class of possible sequences comprises mapping each element into a point in an 
initial constellation. 

15. The method of claim 14 in which the sequence of said points in said initial constellation into which 
25 said succession of data elements are mapped comprise a representative sequence of said class of possible 

sequences. 

16. The method of claim 14 in which the mapping of each element is done independently of the 
mapping of any other element. 

17. The method of claim 14 in which each element is a binary value and said initial constellation 
30 comprises 2 b points, where b is the number of bits in each said element of said sequence. 

18. The method of claim 14 in which said initial constellation comprises a portion of a lattice or its 
translate. 

19. The method of claim 18 in which said portion comprises a fundamental region of a sublattice of said 
lattice. 

35 20. The method of claim 19 in which said sublattice is a time-zero lattice of a trellis code. 

21. The method of claim 14 in which the step of selecting said sequence to be sent comprises a 
decoding of a said sequence of points in said initial constellation by a decoder for a trellis code. 

22. The method of claim 21 in which said fundamental region comprises a common error region of said 
decoder, and said decoding comprises decoding said sequence of points in said initial constellation to error 

40 sequences within said common error region. 

23. The method of claim 22 in which the sequences within said common error region comprise signal 
points that comprise a final constellation with more points than said initial constellation. 

24. The method of claim 21 in which said representative points are decoded by a minimum-distance 
decoder for said code. 

45 25. The method of claim 24 wherein said minimum-distance decoder has decoding delay M and M 2: 1. 

26. A method of recovering a digital data sequence from a received sequence consisting of the 
sequence of signal points chosen to be sent in accordance with the method of claim 2 or 14, comprising 
decoding the sequence of sent signal points to recover the sequence of points of the initial constellation, 
and 

so inverse mapping the sequence of points of the initial constellation to recover the digital data sequence. 

27. The method of claim 26 in which decoding the sequence of sent signal points is done at time j 
based on said sent points only up to time j. 

28. The method of claim 1 , 20 or 21 wherein said trellis code is a linear trellis code. 

29. The method of claim 1, 20, or 21 wherein said trellis code is a non-linear trellis code. 
55 30. The method of claim 28 wherein said linear trellis code is a 4-state Ungerboeck code. 

31 . The method of claim 28 wherein said linear trellis code is a dual Wei code. 

32. The method of claim 1 t 20. or 21 wherein said trellis code is based on a partition of binary lattices. 

33. The method of claim 1, 20 or 21 wherein said trellis code is based on a partition of ternary or 
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quaternary lattices. 

34. The method of claim 8 or 14 in which the mapping of data elements to points in the initial 
constellation is linear or distance invariant. 

35. A method of mapping a digital data sequence into a sequence of signal points to be sent, 
5 comprising 

making a selection of the sequence of signal points to be sent from a class of possible sequences specified 
by said digital data, said selection being based on the average power of said different possible sequences 
of said class, said selection being based on not only a finite block of the digital data. 

36. Apparatus for mapping a digital data sequence into a signal point sequence for data transmission, 
70 comprising 

a sequence selector for selecting said signal point sequence from a subset of ail possible signal point 
sequences based on said digital data sequence, all said possible signal point sequences in said subset 
lying in a fundamental region of a trellis code, said fundamental region being other than a simple Cartesian 
product of finite-dimensional regions. 
75 37. Apparatus for mapping a digital data sequence into a sequence of signal points to be sent, 
comprising 

a mapper for determining a class of possible sequences from which a select said sequence to be sent, 
based on digital data appearing in said digital data sequence up to a time j, and 

a sequence selector, for every time j, said sequence to be sent from said class of possible sequences 
20 based on digital data appearing in said sequence after time j. 

38. The method of claim 1, 13, or 36 wherein the step of selecting said signal point sequence is jurther 
constrained so as to reduce the peak power of said signal point sequence where said peak^ power 
represents the maximum energy of said signal point sequence in some number of dimensions N. 

39. The method of claim 38 wherein N = 2. 
25 40. The method of claim 38 wherein N = 4. 

41. The method of claim 1 wherein the signal points in said sequence belong to a 2D constellation and 
the step of selecting said signal point sequence is constrained so that the signal points in said sequence 
will usually be within some radius R c of the origin of said 2D constellation. 

42. The method of claim 8 wherein the step of choosing a signal point sequence belonging to said 
30 congruence class is further constrained so as to reduce the peak power of said signal point sequence 

where said peak power represents the maximum energy of said signal point sequence in some number of 
dimensions N. 

43. The method of claim 42 wherein N = 2. 

44. The method of claim 42 wherein N = 4. 

35 45. The method of claim 8 wherein the step of choosing a signal point sequence belonging to said 
congruence class comprises decoding said initial signal point sequence into a final signal point sequence 
comprising signal points belonging to a 2D constellation, said decoding being constrained so that only final 
signal point sequences whose signal points usually have magnitudes no greater than some predetermined 
radius R c from the origin of said constellation are used. 

40 46. The method of claim 2 wherein the step of choosing a signal point sequence belonging to said 
congruence class comprises 

decoding said initial signal point sequence into a code sequence using a Viterbi algorithm, and 

in each recursion of the Viterbi algorithm, effecting an operation that will assure that said code sequence is 

an allowable sequence in said second code. 
45 47. The method of claim 46 wherein said operation comprises adjusting the metrics of selected 

historical paths in the trellis of said Viterbi algorithm, so that none of said selected paths will become the 

most likely path in the next recursion of the Viterbi algorithm. 

48. The method of claim 47 in which said historical paths are chosen based on whether they include 

particular state transitions at particular locations in the trellis of the Viterbi algorithm. 
so 49. The method of claim 47 in which said operation comprises assigning a large metric to selected 

historical paths in said trellis. 

50. The method of claim 1, 13, or 35, wherein said step of mapping said digital data sequence into said 
signal point sequence is arranged to ensure that said digital data sequence can be recovered from a 
channel-affected version of said signal point sequence which has been subjected to one of a number of 

55 predetermined phase rotations. 

51 . The method of claim 9 wherein said step of mapping said digital data sequence into a sequence of 
signal points belonging to an initial constellation includes converting said data elements in said data 
sequence into groups of bits for selecting signal points from said initial constellation, and said groups of bits 
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are arranged to ensure that said bits can be recovered from a channel-affected version of said transmitted 
sequence which has been subjected to phase rotations of one. two, or three times 90 degrees. 

52. A modem for transmitting and receiving digital data sequences via a channel comprising 
means for mapping a said digital data sequence into a sequence of signal points to be sent, including a 
s sequence selector for selecting said signal point sequence from a subset of all possible signal point 
sequences based on said digital data sequence, all said possible signal point sequences in said subset 
lying in a fundamental region of a trellis code, said fundamental region being other than a simple Cartesian 
product of finite-dimensional regions, 

a modulator for sending said signal points of said sequence via said channel, 
70 a demodulator for receiving a possibly channel-affected version of said signal point sequence from said 
channel, and 

means for recovering a digital data sequence from said possibly channel-affected version of said signal 
point sequence. 

75 
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